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ABSTRACT. . . - , • 

The purpose of this monograph is to show that 
sufficient ^achievements in scaling applications have been made to 
justify serious study ^of 'scaling methodologies, particularly 
multidimensional scaling (MDS) as a tool for geographers. To be ^ 
useful research, it was felt that -the common method6l6gical and' 
technical problems that specialized -^researchers sh^are wi%h othefr 
sch^ars should be /indicated tiy review, of the applications, and that 
an adequate statement on thet mathematics and heuristics of scaling 
algori'thms is necessary. As a review of applications, tS^biroutines in 
^scaling programs' are- "^dissected" in order to understand how certain 
critical parameters are defined an^ used. This xesearcft work is 
presented in tTiree parts relating to -l) ba:^fc f undameit^taAs , of 
scaling, data, requirements, and Algorithm construction^ and problems; 
2) two step-by-step examples of the non-metric sectiori^^ of a • ' 
multidimensional scaliiig algorit^^; knd 3) a review o£j geographical 
applications of the approach in a^/Srariety of problem, axeas. The 
position of this paper is that M&S provides a- useful and constructive 
m^ethodology for eicamining tho. problems of preference /and cho^ice for 
researchers in geography.* In- conclusion, soae problems of 'using MDS 
are* jfieationed and its potential uses in geography giv^n. 
(Author/ND) . ' ' ^ 
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FOREWORD 



The Technical Papers are explanatory manuals for the use of both instructors and 
students. They .are expository /presentations of available information on each sub- 
ject designed to encourage innovation in teaching methods an^ materials. These 
Technical Papers are developed, printed, and distributed by the Commission on 
College Geography under the/auspices gf the Association of American Geographers 
with National Science Foundation support. The ideas presented in these papers do 
not necessarily imply endorsement by the AAG. Si/igle copies are mailed free of 
charge to all AAG members./ ^ 
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^ * . - <^ PREFACE . ' 

An invitation fru'm the Commission on Quantitative Methods. International 
Geographical Union, in November 1969, *'to pre*pare for 'Multivariate Scaling' a 
detailed discussion of the technique and the reason Jor its use, endmg with a 
step-b>-step wurked^example using real data/* for presentation at a meeting of the 
1 G.U. Quantitative Methods Commission in Poznan m September 1970, appeared 
to us to be an incentive to weave together the patchy knowledge of scaling alg^- 
\ rithms that sve had found scattered throughout manj' periodicals and manuscripts, 
, ' For geographers who look askance at the necessity of becoming acquainted 

with yet another statistical tool, we can only express- our conviction that the 
importance of the scaling problem h-as been too little recognized in geograph) and 
that, insofar as a major area of geographic interest is with choice data, scaling 
te.chniques provide the tools for anal>zing such data. As earlier st>les m g(5ographic 

' res8arch emphasuing structural descriptions give wa> to research stales that, search 

for process laws first and then explore their spatial implications, so techniques thaV 
are suited to (he anal>sis of choice and the preference functions underlying choice, 
must be adupttjd. In comparisgn with the parametru statistical techniques more 
commonly u^ed b> geographers, non-metric scaling techniques allow the researcher 
to"^e more flexible in searching for functional forms and in designing schemes for 
assembling basic data. To take advantage of this flexibilit> is both a problem and a 
challenge. Our hope is that this monograph will indicate to the skeptic that suf- 
ficient achiCvq^ments in scaling applications have alread> been made to justif> the 
9^ serious study of the still developing area of scaling methodologies. 
, < As with an^ basic anal>tical technique, applications range through every sub- 
Tield of ^eograph>. Consequentl> we felt tljat^a review of these applications might 
be useful in indicating to researche*$ immersed in their specializetl area, the com- 

^ mon problems of a methodological and technical character that niey share with a 
far Wider communit> of schuhirs. In preparifig the review, however, w^ felt that an 
, ad,equ|te statement on* the mathematics and heuristics of scaling algorithms was 
' , .first necessar> . Ideall> . ue would have preferred to make a citation to a review that 
alread> existed. Search as we did, we were not able to locate a seview which served 
ouir pufp,o5es. though we saw some references to reviews '*in preparation/' 
excellent treatments of Torgerson [57 1 and Coombs [7] largely predated the major 
developments of .Shepard [50, 51] and** of Kruskal [28, 29, 30}. These, in turn, 
spawned further theoretical and empirical works that we soon found were not 
described adequatel> m print even though their understanding was critical for those 
researchers who depended on scaling in th^ir substantive work. We found that 
ijiformation on this subject was Jbeing transmitted through numerous working 
papers, discussion papers, unpublished dissertations,^ newsletters, ^nd computer 
, program printouts. In our own research we found it necessary to "dissect'' sub- 
routines m Scaling programs in order to untierstaiid how certain critical parameters 
were being defined and used. Thus the work js presented in three parts relating to 
basic fundamentals of scaling, datajequirements, ani algonfhm constructions and 

, / . problems, two step-b>-step worked examples of. the non-metric section of a multi- 



dimensional scaling algorithm, and a review of geographical applications of the 
approach m a variety of problem areas. • 

The willingness of some of the most .prominent researchers in multidimen- 
sional Scaling to send us unpublished manuscripts, reprints, computer printouts, and 
test data, and, in the case of Professor Lingoes, to provide us with a most helpful 
critique of the I.G.U. paper, was a source of inspiration to us to complete the work, 
fn addition, several of our students helped us b> working on some of the program- 
ming problems and^b> runnmg experimental data through the various algorithms.' 
Although W(f had intende^. }o experiment further with different algorithms and to 
report resi^lKs in this pa^er, we found that Lingoes and Roskam [33] and Young 
and Applebaum [62] had admirably designed Wperiments and r.eported on this 
question. 

We thank Professor Jiimes Lingoes of Hie University of Michigan and Pro- 
fessor Forrest^Young of the University of North Carolina for the,materials they sent 
us and for their most interesting communications. We thank also Professor Waldo 
Tobler of the University of Michigan for his .interest in the work, for generously 
allowing us to use his^trilateration example in Section I, and for showing us the 
variety of map transfc5rmations applications of scaling. At the University of lowa^ 
Mr. John F. Hultquist and Mr. Stanley R. Lieber have helped us by testing an(j 
1 using the TORSCA scaling algorithm; at McGlH University. Dr. Gordon Ewing 
helped us by describing and commenting on his use of the Gi/ttman-Lingoes SSA-I 
algorithm, at Ohio Staje University Dr. Ronald.Briggs^and Dr. Donald Demko(now 
at the University of Texas and Queen's University, respectively) experimented with 
unfolding techniques and with the Kruskal MDSCAL series of programs; Professor 
L. Neidell formerly of the Department of Marketings SUNY at Buffalo, drew our 
attention to several interesting spatial applications of scaling methods outside the 
field of geography and providejj tapes *of the entire Guttman-Lingoes series, Pi:o- 
fessor Paul Isaacs of the Psychology* Department, Thf Ohio State University, gave 
^ptical technical advice on tht int^erpretaUon of varic^us algorithms. We also 
acknowledge that the Computer Institute for Soci^/ Science Research, Klicl\igan 
State University, provided the initial stimulus and the programming assistance that 
led to our applying multidimensional scaling techniques iji our own work. Finally, 
we acknowledge our gratitude to Dr. T. Czyz, Df/Z. Chojnicki, and^other Polish 
geographers who created such a convivial atmosj^here for -discussion at the LG.U. ^ 
meeting in Poznan; September 1970. 

• A'cknowledgements are also due to the following people arid (^rganizations for 
permission to^ use diagrams and maps. Professor W. Tobler, University of Michigan, 
Northwestern University Press and Department of Geography; Association of 
American Geographers, Canadian Association of Geographers, Dr. Paul Schwind, 
University of Hawaii, Geographical Analysis, and Professor Peter Gould, Pennsyl- 
vania State University. • ofc. ' ' 

Additional support was granted to Rushton by the Graduate College, Univer- 
sity of Iowa, and the United States Army Research Center, and to Golledge by the 
Departments iof Geogr^hy, University of Sydney, Australia, and The Ohio State 
University. ' ^ - ^ . 
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I. AN INTRODUCTION TG S6ALING AND -SCALING^ 
ALGORITHMS 

*. A. Scaling 

The fundamental idea of scaling is to produce a range, of scores that It^ve mean- 
ing either with respect to each other*s values or to some arbitrary or absolute value; 
<§et or accepted b> the scale. A scale generally consists of a system of numbers 
related b> ^orresponcfence rules which enable meaning to be attached to the objects 
possessing them. For example, a number system is .a scale which can be nominal, 
ordinal, interval, or ratio in nature [27, pp. 9-12]. The explication of number 
system involves itemizing the correspQndence rules which give meaning to each 
number in the system [41]. Thus, we can envisage that potentially there is an < 
infinity of ways to scale data, generally, however, the scaling problem reduces_to 
one of devising rules for the measurement of a construct or phenomenon such that_ 
the resulting measurements provide an easily interpretable and admissible trans- 
formation into numerical fom^f the phen\)mesnon being scdled. 

The advantages of scaling aret Similar to those which derive from 4he measure- 
•ment of* properties in general: > « , • 

1. scaling makes it possible to differentiate among instances that may be lumped 

together in a given class of things (e.g.. degrees of "warmth*')i^ v 
2ff scaling can sl^ow^ relative position rather than^ust difference; and 
1. pealing allows systern'atic manipulation .of the 3»-aled itejtis in conformance 

with concepts and theories of logic and mathematics. ^ ' \ 

Scaling methods are genefally subdivided into two classes unidimensional and^ 
multidimensional, Unidimimsional scales are those which measure variation mth 
respect to one attribute (e.g., color hue, population size, neigh*borlintss, per capita ^ 
income,, itsidential status, social rank, degree of urbanization, and so on). For 
example, "Berry '"and^Gihsber'g^ [4] constructed a series of unjdirnefh^ional scales 
measuring a variety of urban, economic, and societal characteristiijs of various, 
countries of the ijtorld and used those scales in a multivariate analy^is4of levels of 
/^economic development. Similarly, ^he urban rank size^nile an(^ie indices of 
^ ^primacy developed by Linsky [35] and Mehta [40] are unidimensid^ scales based 
Jrn population-size which provide a range of scale values. Ho\Ve\^'c', dc^ite the 
''widespread use of such scales in geography, we must be aware of tfie problem that 
1 systematic variation of the scaled phenomena with respect tq, ntpre than the 
'assumed number of attributes may be undetected and may lead to'tkifficulties. in 
. interpreting the results, or to a rejection of experimental res*ilts. Examples mi^t 
include divergences between actual migrant moves and scalfes expressing the "desir- 
ability'' of plac65 of settlement, or variations in the attracting power of place3 
resulting from variations in'thejimriber of functions rather than Just population "Size 
• [3]. ' - ^ ^ * * 

4 Multidimensional scaling aims at developing procedures which will ^assign -sets of 
. numbersMo various quantities of attributes such that the numbers directly reflect 
^ variations in the quantities of the attributes among the phenomena being scaled. 

' ^ ' ■. ■ . . • 
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' For example, distinctions may be callecj for between objects which vary with re- 
spect to color, locatio;!, shape, size, and so on. Sugji ^bject» form a multidimen- 
sional series/and the scaling procedure attempts to idewtify a number of relevant 
^ dimensions of these "objects. To do this, data are collected to permit the detection . 
of s>stematic variation with respect to morp than an. assumed small number of 
attributes. For example, if scafing fs used an evaluative device, a subject is 
generall> instructed to consider the similarity or dissimilarit> of stimulus objects 
(considering all relevant attributes),Tather than being asked to rate ©bjects accord- 

♦ing to the magnitude of some specific attribute. Specific examples of these pro- 
cedures are discussed later in tHq^aper. ' ; . • , ' « ^ 
Recognitior> that an> object may have a number of attribufes, and tha't different 
, , attributes ma> be used b> different^ individ^uals in their attempts to scale^'the objects 4 . 
in some wayr^led tb the .condiision that arfy given object could be regarded as >-s. 

. existing in an /i-dimertsionaI. space, where n represents the nuijiber of*p^eived or 
actual attribute. The quantity pf each at{rib#fe belonging to an object can then be 
interpreted as a geometrical coordinate which, when used in conjunction with other , 
quantities (coordihatesj, determines the Igcatron of eachobjegt in 'the,;?- , 
dimensional space. The significance of this is, that if individual objects can be thus 
located, then interpoint distances can be calculated and objective statements can be ^ 
derived concerning the distances separating \&rious objects. 

There pre sohi6 important points to conSiaer about this geometrical interpreta- 
tion of the multidirriensional scaling process. First, consider the case where ^objects 
are located in an w-diitiensional real Euclidean space. Here each number associated 
with an attribute gives the proje,ction of the object on one of the cpordinate axes of 
thp space, in other wordi/it allows us to determine the distance of the object from 
an ongin along a given axis. Tha distance between two^poii)ts located in this ^pace js 
given as follows: . -i " * 

" * ' w^iere j and^k are the two points, - ^ * 

" ' {r fe an ind«x of the ax*es, ..^ 

/nis the number of orthogonal axes, * - 

.1^ , and p ., Pj.,^ are the projections of the paints on acxis f. - 

What is particulafl> important is that given th^stanQes' between all pairs of points 
in the space, the:projectiqins of the ptiffits on any arbitrary set of orthogonal axe's in^ 
the space *tan be determined. In other words, given interpoint distances, we can 
recover the number of dimensions in which the points exist. For any set of . inter- 
p.oint distances there will.be a sTpace of minimum dimensionality in which a^^atis- 
factoril> large number of the interpoint distances maintain their relationships one 
^ to another. One 0/ the .aims of multidimensionaLscaling is to iderirify this space of 
minimum dimensionality and to interpret each dimension in terms of srimulus 
. attributes. * ' \' ^ ^ ^ >^ 

\ - The second critical feature of multidimensional scaling is that it is not necessarS'^ v^* ' 
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to have metric information on the interpoint distances as input. Because subjects 
are not asked to make decisions with respect to a given number of attributes, they 
are free to choose any number of attributes Jthey desire in order to make distinc- 
tions between objects. Thus, instead of imaginlhg thaf each' stimulus object has a 
legation in a real Euclidean space, the subject ^OQjtes each .object in a ^psycho- 
logical space" whidh may exist in a quite different form of geometrical space. In 
addition, subjects locate points in Xhe space merely in terms of their being "nearer," 
"greater than," **more similar,'' "more preferre d^ ( and so on) to any given object 
than they are' to other^The aim of multidim^Rc&ial scaling is to take \he data 
collected with respect to stimulus objects and to recover from these dafa a spatial 
configuration of points in an identifiable space of minimum dimeiisionality. 
■ ' ~L ■ \ ■ , ' _ ' 

*B. Data Requirements and CoUectioit' ' 

» It. was sug^sted abc^e that multidimensional scaling procedures are versatile in 
that they can takers input either metric or non-metric datV^Since discussions of 
the various types of metric data can be found in anyi)asic mathematics or statistics 
book, emphasis will be placed herd ph using'ndn-metric information. 

Ct56n>^jj7] ' argues that there are ^ur. phasic kinds of behavioral , data—., 
preferenfial cHoice, single'Stimulus, stimulus-comparison, and sinlilarities data. 

A^ume that we have a sample of individuals and a collection of phenomena an(! 
that the individuals are asked to state their preferences for the phenomena. The 
.instructions may be of tha following type^: \^ 

1. Choipse one out^ a set of phenomena. iT* 

2. Choose oiit oFa set of w phenomena. > - ^ 

3. Choose one of a serieS'<2fsubsets 6f the n phenomena. 

4. Choose one out of every possible pair of the i^henomena. 

When sample n]emberS perform *one or^anothe/ of the'^^above^ tasks they state iheir 
preferences, for the phenomena chosen, and the data collected are called preferen- 
tial choice data. For this type of data, we assume that the set of stiniuli and^ the 
^individuals revealing .their preferences can be mapped together into a joint space. 
This joint space i^ a psychological sp^ace, ^nd both stimuli and individuals are 
mapped as points in such a way that the relations among the different points in the 
sp^cc reflect the observed preference orderings of the individuals. It can be hypoth-^ 
esized that each of the individuals who H3s'b5en asked to state preferences among 
the stimuli will have different ideals^as to what an appropriate combination of , 
stimulus attributes should Thus, if a particular s|imulus has more or less of a 
particular attribute, individual preferences might reve^ it to be more or less desired. 
Through a' sequence of preferential choice statements, an individual reveals where 
each stimulus point Ijes with respect to his ideal point. We can imagine that the 
same basic dimensions are used by most individuals in order to make judgments 
about 'the stimuli but that^each individual may prefer different quantities of each* 
attribute. Thus, the attributes that are perceived in any stimulus wilfbe mapped 
into the space in proportion to whether or not an individual desires that particular 
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attribute, in other words, in relation to where his ideal p(?6^ lies with r(Jspect to 
anV dimension b> which a given attribute is presented. Given a sample of individ- 
uals and a sample of stimuli, the's|ImuU may^tee^mpped into the same space as the 
individuals. The problem then is to^iFKh'how the indiNidiials and tlie stimuli can be 
mapped into a spa^e of minimum dimensionality tu reflect the preference orderings 
of each ofjhe indiviSuals, ^ 

This particular problem becomes complicated somewhat when we realize that 
when chousing .between pairs of phenomena, for example, subjects will not always 
reveal the same preferential ordering. Indeed, in an, experiment where a large num- 
ber of preferential choices are made, each individual may make intransitive choices, 
that«is, he may prefer A to B, B to C, but C to A. If there are no such intransitiyities 
in the preference ordering, thQn stimuli can be rank ordered for each individual 
from most fo 'least* preferred ^nd a unj^dimensional scale for that individual can be 
compiled. WTiile the_/^roblem still remains to fit together alV the individual uni- 
dimejisionaLscales, it beconles far more complicated if any given individual has 
intransitivitiaf ili*his preference orderings. Intransitive preference rankings (or 
preference ofderings) of objects cannot be arranged in simple rank order. Intransi-* 
live preference urderings m^y occur when a stimulus object with multiple attributes 
evokes in 3 respondent an ordering based on a pa[ti|:ula/ dominant attribute for one 
comparison, and then evokes a rank ordering or\ the basis pfa different attribute 
for a second comparison. For example, candies, may be considered to have the 
attributes of brittlenegf, chewiness, quality of chocolate, qr type of filling. Given 
any twQ candies, an , individual may compare thdniTrorti the point of view of 
brittleness but when comparing eithef of them with another, an alternative attri- 
bute such as type of filling may b.e used as the distinguishing criterion. 

Cooinhs' idea with respect to thii preferential choice problem is to unfold the 
space in which preferential 'choice^ are being made^uch th^t both individuals and 
ol^cts are mapped as points in a joint space and the mutual relations am(5ng the 
points reflect the observed prefefence orderings of the vanous individuals [7, pp. 
^80-192). The unfolding .of choices is not our/najor concern in this paper, biitft 
does provide one basiJ for the multidimensional scaling algorithm developed by 
Coombs, which is referred to later in the paper. ' » 

The secon^ tyj)e of datit thdt Coombs defines is single stimulus data. Her^vWe 
assume that our sample Individ i\gls are presented with a set of homogeneous stimuli 
(i.e., stimuh from a single populati6n such as politicpl candidates, supermarkets, 
etc.), only now the individual is asked to make a judgment abput each stimulus in 
turn*. For example, one may be asked whether or not h^^cJiil3 vtit^or^a certain 
politician. In this' case, if we again regafd each individual as h^ing an ideal point 
then we can imagine that each stimulus when presented to the individual is said to 
ho either within the^. ''neighborhood'* of his ideal point or outside this hypothetical 
neighborhood.^This would generate his, yes-no response. These. data are sometimes 
described in abstr^act terms as "proximity relations" and are some.what different 
from the order relations expressed in preferential choices. 

A third type of data arises when we ask individuals. to^determine an order 

' " " . 4 ' ' 
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relation on pairs of points from the same data set: these are called stimulus com- 
parison data and the miethod of collection is generally the method of paired com- 
parisons. Consider the^'candy example mentioned previously. Assume that ,we sug- 
gest every possible pair of candies to our subjects and that we ask each indvidual to 
judge between the two candies on the basis of.some actual or perqeived characteris- 
tic. In the simplest Of all experiments we would simply choose a single attribute 
such as brittleness, for example, and ask for comparison between the candies on the 
basis of this attribute. Judg?Tients would therefore be presumed to reflect dif- 
ferences among stimuli and not among individuals. If, ho\yever, we find that the 
individuals do not agree with respect to their interpretation of tfifeNat tributes, then a, 

^ frequent technique is to subdivide the population such that the groups created are 
^homogeneous with respect to ^ their perceptions of the attributesVsNote that in 
comparison to the two previous procedures th^ individual heed not b^lfttecoreted 
>is a point in the same space as the stimulus. Rather, the^comparisons betweeh the 

.stimuli enable us to locate them iri the space of some dimensionality such thatW 
distance between each pair of points in the space is interpre^ted as an indication o| 
the similarity or dissimilarity of the objects. Short distances indicate high degrees o| 
similarity and large distances denote dissimitarity. 

Thfii method of collecting paired comparison data can differ somewhat from' 
expeiiijient tq e?cperiment. In stimulus comparison experiments^ frequent use is 
ma(ie of paired comparison, triadic comparisons, and* comparisons of pairs of dyads 
[7, pp. 3-59; 444-462] .The typical paired comp^arison experiment involves Subjects 
being. given ill .possible p^rs of objects and then being asked for some type of 
comparative judgment related to -the pairs. If it is assumed-^that self-similarities (i.e., 
comparing .the object to itselQ are ignored and complementary comparisoris are' 
equivalent (I.e., ArB.4s assumed the same as B-A), then there are -n(n-l) pairs of 

objects, for 'wKicfi!somfe type of comparative judgment is requested. The ultimate 
» ^ aim is to obtain the'ordenng of the paired objects ^upon the* basis of some psycho- 
lo^cal continuuik The procedure is based on one^of the fundamental principles of 
the Law of Comparative Judgrne|it.*This states that any. given stimulus has asso- 
ciated with it a most frequently "liroused or modal discriminal process on sonie 
continuum. J t js accepted thit any subject may choose different attributes of the 
stimulus object when comparing the object with others,' but it also assumes that the 
* discriminal process (or reap,tion of the subject to the stimulus) is distributed nor- 
mally around a mode which can be called the scale value of the object. Thus, any 
two objects may differ with respect to tlieir scale values. The purpose of the paired 
^^omparison experiment then is to allow the scale values fof any two stimulus 
objects to be compared so that statements can be made about the degree pf simi- 
larity or' difference that they evoke in any respprtdent. By finding the frej^ncy 
with which stimulus / exceeds, equals, or is rated less' than stimulus; wProain 
judgment as to the relative magnitudes of their respective scale value? 

in stimulus comparison experiments, the stimulus attributes upon^Wch judg- 
ments are made are clarified for ttie individual. If we permit the subject freedom of 
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choice in companng stimuli, this leads to^jhrdev^Topment of a fourth kind ojdata 
called similarity data. With this type of data the individual is presumed to peiceive 
each stimulus Ss a( union offsets of attributes. Again, it is assunjed that the stimulus 
Js able to be represented b> a point in space, the coordinates of this poin: cor- 
respond tb the projections of the stimulus on tfie various dimensions whici the 
individual chooses as being relevant. The object oT making this l^pe. of comparison 
is to attempt to determme the minimum number of dimensigns which are us<Jd by 
individuals when comparing objects. For this type of data, comparisons are nade 

• usually by asking subjects if one pair of stirjiuli are more nearly alike than another 
pair. In other words, ^we attempt to find out whether the distance separating; one 

« pair of stimulus points is less than, equal to, or greater than the distance separating 
another pair of corresponding points. In a sense, an.attempt'^is made to determine 
an order' relation on the distances implicit between pairs of points where al the 
points are from a single homogeneous set. Multidimensional scaling modeh are 
commonly." used with similarities ^ data in an attempt to construct some typs of 
stimulus space from measures of tl^e interpoint distances between phenomena. 

Of the various^^types of data available, geographers have experimented to sbme 
extent with each^on^al though the use of paired comparison procedures for colljpct- 
ing data is becoming soipewhat more popular. Examples of such experiments 
include asking consumers b select one of a pair of shopping centers, towns to \ljisit 
in order to purchase given go^, or towns which would, be selected for purposed of 
migration ..The freqifency witlt-whigh any given pair m'ember isxhosen over othWs 
is then recorded [5,4^] . For example, Rushton jijterpreted the movement tof . 
farmers to towns in Iowa as*^e%utcome of a pl^oice process which could be 

, inferred to be a paired compar|^ type procedure.^Visits to each place were tranV 
formed into dissimilarities measures, first, by recor^ding the number "of times town 

'''^type / was chosen (for a^specified shopping trip) ov^r town type / when both / and / . 

^. we/e present in a feasible area; secdnd, by regarding a proportion of 0.5 as being the 
maximum perceived similarity, apd then by fir^ying the difference between the 
derived proportions and this maximun\perceived similarity: 



(2) 



where d ^ is a measure of dissimilarity. Here a sfhall value for d.^ represented small 
dissimilarity between towns, and a large valfie indicated considerable dissimilarity. 

Most multidimensional scaling analyses use ooly the upper half oi a paired 
comparison matrix. Thus only n(h-l ) comparisons are used as input. Most comput- 

• ing algorithms, however, allow optional inclusions for diagonal and lower half 
matrices and they can be' used either with complete or incomplete data. 

C Approaches to Npn- Metric. Multidimensional Scaling (MDS) 

It was j)reviously suggested that implicit in every collecron of proximity 
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measures (such as/measures of sunilyity and dissimilarity) is a ki^a of spatial 
structure. The basic problem oj^MDS is to unc6ver this structure. While it is general- 
ly agreed that greater degrees of similarity infer closer distances (and vice versa), the 
fonner are only; ijfnplied distances and ma> not easily be transformed into metric 
form. However, in Shepard's terms: "If some monotonic transformation of the 
proximity me^ures could be found that would convert these implicit distances into 
explicit distances, then we should be in a position to recover the spatial structure 
contained only Fa^ently in the original data" [50, p. 127] . 

The various approaches developed by authors such aS' Coombs, Torgerson, 
Shepard, Kruskal, McGee, GUtlman, Lingoes, and Young represent attempts to 
recover the latent spatial structures contained in proximity-type data [7, 57, 51, 
29,38,39, 18, 32;63]. * * ; 

The relative advantages of the non-metric approach in searching for'latent spatial 
structure have been summarized by Lingoes andtJuHman: 

One the chief benefits to be derived from const^ining the solution non- 
metrically, is, of course, that in general a smaller space is required to reflect order than » 
to reflect metric, Qf greater importance, however, the dimensions themselves may well 
aid our understanding of the underlying interdependencies free of the attenuation that 
can result froan Tion'linear relationships, furthermore, when some lawful structure or 
pattern, is present in the data, e.g., a ^simplex, a circumplex, or a ra^dex, a npn'metric 
analysis vWU reveal the configuration whereas a metric approach will obscure the law- 
fulness (32, pUj8l7] . 

1. The basic Merijferits of non-metric MDS algorithms. Although there are dif- 
ferences amon^ the algQpthm^^urreptly in use, there are also broacj similarities in 
ternis of their constructjon^., Features common to the majority^f the techniques' 

include:*:':- J ^ ' T ' . 

a) an initi| Set of input data, frequently generated by a paired comparison 
. experiment, within which is contained a latent spatial structure (such as 

-dissimilarities data). These input data c.an be^priepared in random vector 
mode, or in. the form of a symmetric matrix, a rectangular matrix, or a 
, triangular matrix. 

b) an initial configuration of interpoint^distances which is manipulated on 
^ successive iterations, in-an attempt to define a .monotone relationsWp ' 

between the configuration and the original data. 

c) ^ computing algorithm (a non-nietric scaling method) which incorporates 
* the strategy for achieving convergence of the. data and the configuration. 

d) a. ios? funcrion. (or "goodness-of-fit") function whicl] is used to 'guide 
and/or terminate the iterative procedures. . * 

e) su'broutines for handling missing data and tied data, and for determining 
step size motions within each generated configuration. ' 

*0 technfques for estimating the configuration Reformation a&lhe number of 
"J^^t^n^nsions in which the configijration is'plotted is changed.^ 
^-'^A^enerdlized format for^a non-metric MDS analysis of complete data has been 
• provided (as follows) by Lingoes and Roskam [33, f)p. 1 1-16] . '* 

\ ' « , 

00018 




' Let A = a /:-element array (or vector) of arbitrary indices of dissimilarity 
between all pairs of n-objec ts, k = n(n- 1 ) /2 ^ 
6jj= thegeneralelement of A, (i= 1,2,. ..,n-l\ 

j = i+l,i+2, .,..,nj 

Z ^i, ^"^ Ignore therefore s>mmetric elements and diagonal elements. 
Define D as a ^element vector of real numbers with elements d . = f(P..) such that, 
when6..<5. ..eitherd. <d. ,,ord . 

Note that- d,j < dki implies semi-strong monotonTcit> when some A-elements are 
tied and strong monotonicity with no ties, and d,j < implies weak monotonicity 
for\o ties and semi->veak monotonicity for ties (the following section discusses 
. monotonicity). 

Assume D A monotonically: now D is a mpnotonic transformation of the A- 
vector whose furiction is to weight the iterations for moving a configuration toward 
its goal and to form a basis for evaluation of goodness-of-fit at any iteration. 
Define X an an x m matrix of rectangular coordinates (for a given configuration) 
wilt w representing the number of dimensions. 

Define D as a /:-element vector of distances calculated from X between the /?-points 
embedded m a given space according to the standard distance formula given earlier 
as equation (1), The general problem statement can then be formalized as follow^, 
^ven A, an initial confi^ration X, a fixed n, and distances calculated between the n 
points, try to get D as close as possible to D (i.e., minimize some loss function), for ' 
then D will map into A. ^ 

The general procedure is to: ^ ^ / 

0 determine an initial set of coordinates (X) and select an appropriate 
dimensionality (m); , 
j 2) compute the Euclidean distances (D);, 

j 3) solve for D v the* predictions of the appropriate distances 

' {i-^'d..=f(6..)}; ' ' 

4) .compute the normalized loss furiction (e.g.. STRESS); • 

5) if the loss function is small enough (or not changing "sufficjiently'* from 
on(? iteration to the next), terminate. Othervyisc modify X and return to 
'step 2. . 

At this stage, it is pertinent to discuss some of the basic ele^ments of MDS" 
approaches as a means of explaining some of the rules which govern algorithm 
construction. In particular, we plan to focus on the requirement of monotonicity, 
methods of deriving initial configurations, goodness-of«fit criteria, treatment of ties, 
and some brief coniments'on dimensionality. Summaries of a selection of 
. approaches to MDS will then indicate some differences in the strategies used to 
recover configurations from non-metric data. . i 

2. Mpntonicify requirements. The essence of MDS algorithms is the requirement of 
maintaining a monotonic relation between the original dissimilarities data and the 
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I'which afe deri\ed to represent them. While it may seem at first thatTfie 
Son of the monotonicity restraint is a weak one, Shepard has argued that. . 



If ,non-metrrc cunstAiints arc imposed m sufficient number they begin to act like ^ 

ni^c, constrain^ In the case of a purely ordinal scale Jhe non-metxic constraints are J 

reli^cly few qjnd, consequently, the points on the icale can be moved about quite ^ 

cx^^^lg without violating the inequalities (i.e., without interchangiag any two ^ 

po^^^As these same poin'ts are forced to satisfy more and more inequalities on ihe , 

in^F^iRt distances as well, however, the spacing tightens up until any but very small , 
pcmifbauons of the points will<ictually violate one or more of the inequalities. [51, p. 

^^F' 

Thu^i|^we impose a monotonia cpnstraint on the relationship bet^wee^ a set* of 
• dissiimK^^ measures and a^et oWistance measures, we are in effect assuming th^'t ^ 
the ral?brder of dissimilarities is by itself enough to oUtain a solutidn-i e., tP 
obtah^^^ent spqtial structut.^. , | \ 

Mp^^^icity requireme.nt's can,' be specified in ^ number of ways! (18, pp. 
48(MMl^Lct R be the, number of distinct values among the off-diagon^ elements 
of a ^^p^onsisting of an initial set of distance ranking numbers; R =^h(n-l) if, , 
and c^^l^ there is complete information (i.e., no missing data) and no ties. Where 

each i^mlfer is tied only with its transpose in a symmeUic matrix R < n^n-l); R = 

m^-^ ^' , — ^ — 

n(n-.^^here information i§ complete and the only ties are with the ,*transpose. 

• ■ ^ . • V 

Now!&f^fea trial matrix of distances, let Q be the numbet of off-diagonal elements 
5vhicMare specified (i.e., not rnissing). Because qf the symmetry of the data, 
Q < n |C%l). This allows for the definition oC three tyj^s of monotonic conditions 

based^^^ pptential presence or^absence of ties and/or missing data: 

^^^S? R, strong monotoniciiy is said to exist (i.e., no ties) ; 
2W^^ R, s&^hstri^ng monotonicity is ^id to.exist, ' 
^^^^ monotonicit^ is saidjo exist. ' . . 

"li^^neral, permitting the removal of ties enables a smaller spac^ to be attained 
thariS^'ferjvise And most algorithms adopt conditions of either Q > R pr Q < R. 

Tt^^^J* objective of the monotonicity constraint is to ensure t*hat the rank 
ordei^ 
cal > 

- similarities data). Thus points with the smallest dissimilarity nfeasures 

shouB&^nd up with the smallest distances between them in the final spatial struc* 
tur^S?^ perfect monotonic relation would involve exactly the same ranking for 
corresponding pairs of dissirtlilarities and configuration ^distances. 

be a measure of dissimilari^ty between n "types"^of stimuli [28].' For a 
' such dissimilarities the intent is to represent the n types as /z ppints in / 



^lerpoint distances obtained from some cfonfiguration of points isidenti- 
M the rank order of 'the original dissimilarity measures (or inversely mono* 
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^dimensional space, \\^erein 'the interpoint distances (d..> correspond to the ob- 
served degrees of dissimilarit> between^he n types. Perfect correspondence would 
mean, for example, thatjf type r is more similar to type / than it is to type k then 
the corresponding interpoint^distances would "satisfy^the same«relationship^f6r all /, 
/, that IS, where 6 > 6^^, d. > In other words, if the Vocational types are 
shown on a scatter plot in which the ordinate is dissimilanty (5) and the abscissa is 
distance (d), then as the' points are traced one b> one from bottom •to top, the 
move is always to thfe^ right, never to thejeft. \Vhen this requirement is met, a 
monotone relationship bptween dissimilfrity and distance has been found. 

In order to^ obtain appropriate rank orders for the distance measures, a piono- 
tone regression i$ performed and in this way, by minimizing the sums of squared 
differences between the derived set of numbers and the dissimilarities, tt is ensured 
^ that the set of numbers representing the distances are *'as much like" the dissimi- ' 
larity measures as possible. In essence a monotone regression between dissimilarity 
and distance measures requires that only the distance measures be moved at e^ch 
iteration (since the dissimilarities are dimensionless numb^). Differences between 
MDS algorithms arise when different cnethods are used to define (he transfornied 
distances and to perform the movement of Oie points. The critical problem is to 
determine the direction and magnitude^of any mpves of the points that have to take , 
place; the two most favored methods are the "method of steepest descend" (Or 
gradient method) and the "me'thod of coTrecJtioH matrices" (29, 18] . 

3. Treatment of ties. Kruskal provides two options for the treatment Q^ties. The 
first or primary approach treats ties as an indeterminate order relation v^hich can 
arbitrarily be jesolved in such a way that either STRESS or dimensionality can be 
decreased The secondary approach regards ties as being evidence of an equivafence 
^el^tion. which^should, as far as possible, !be maintained-even iP»the result is to 
-.increase dimensionality or stress [33, pp. 36-37]. ' . 

In the primary approach-the one which B^skal originally preferred [20, p. 
22] -when t\yo dissimilarities were^equal (i.e., 6.. = 6^^) it was argued that it was 
no great concern whether d.. or di^^ was the larger, or whether in fact they w6re 
equal or not Thus if d ^^^=(1,^^ there was no pressure to downgrade the cpnfigm^'tion, 
the inequality was not reflected in, the stress value, and no constraints .were placed 
on the estimates of distance (d & d^^). The terms (d.. - d. y and (d^^ - d^^)'^ ' 
were therefore permitted to be zero (subject to the influence of any other existing 
constraintsj. Whenever d^^ < d^^, the (transfornied) distance estimates had to'fpllow, 
the same monotone relatronV^^ could be equal (i.e., d . < dj^^). 

The secondary approach on the^other hand argtieU that if 5. ^ 6^^ then either 
^ij " ^ki configuration should be downgraded until the equality holds. Thus « 

estimates of the distances must be equal (d]j = d^^,). If ^i^'^d^^ then (d..- d..)^ and ^ 
(dki - d^^y are not zero and are reflected in the stress valub. In other words,\' 
^Whenever6^j <6^jthen d.^ < dj^^*, and whenever 6.. =5j^'j''then d. = d^^^. 



'The essential difference between these two ways of'treating ties is in, the arnount 
they comribute to stress. In the primary approach, tnl ties will contribute nothing 
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to stress unless th^y are merged with preceding or succeeding blocks^by^virtue>oTr 
the,monotone regression thatjs perforfnerf. The secon1[iary approach.iijeffgot^ses 
the me^ of the distances X^..) For the tied blocks, artd thus they remairT tied 
"Whether or not the tied blocks ^re ftierged. with others djuring tfie monotone regres-^ 
sion. The {primary approach appears to be the most^widespcfead in the major non- 
metric MDS algorithms, and a recent study^ has coocluied ^that the primary 
approach is to be preferred (33, p] 136] . * /'"\ .: 



4. Goodness-of-Fit. The problem of determining the "best fitting" confi^rjtibn of 
derived distances is also important in all the various approaches to MpS, and tiie / 
method of resolving it varies from algorithm to algorfthm..F6r example, Kijfskal 
[28] Uses the normalized residual variance from the monotone regression to define ' 
a measure (calI?3^TRESS) which forms the basis^for determining the direction ^uijl 
magnitude of distance movements at each iteration, and which when minimized, 
gives an estimate of go6dness-of-fit of the final configiration. Other goqdpess-of-fit 
measures include "squariance/V'coefficient of alienation," and **WORK** [61, p. 9; 
32,p.489;38,p. 185]. _ ^ ' '^"^ 

The index-of.fit STRESS is used both in the Kn^skal MDSf AL series of non- 
metric MDS algorithms and the TORSCA routines. Basically it is the normalized, 
residua] variance derived from a monotone regression of distance«and (jissirrularityC^ 
The monotone regression in th'is case involves moving only the horizojital^distanceji,. 
measures in order to compute a line of best fit. This requites first matching the 
dissimilarity and distance measures, then, checking each distance measure to see i|*it 
is greater (in a jnonotonically increasing relation) or smaller (in a^monotvinicaily* 
decreasing relation) than the preceding distance, MonotonicVansformations of the^ 
distance v^ues, (called disparities in the worked example mh Section II) afe then 
made to satisfy the inequality conditions in the input matrix. If a^set oT distance 
(•d^) values can be generated su^ that, when they are arranged fn vector form, tjieyi. ^ 
occupy the sam*e vector position as their corresponding dissimilaifty (Sp measui'es, 
^ {hen "perfect match*' is said to bc(jur and stress is zero. . * * ^ : 

The transformed distances (d.j) may be regarded as the* set of nunabers that 
depart to the minimum degree from the corresponding set of computed distances, ^ 
(d..), while ensuring that their rank order is the sarne as that of the ori^nal dissimi- 
larities. Thus if we were to plot a curve to the scatter of (d.j,6.p potnts,'the curve 
should , move vertically or to thff right, never to the left. Since we wish ta work ojMy^^ 

with the rank order of the 6.;, 'stress focusses on the difference between the corn- 
er \ ^ - ' .J 
puted distances and the transformed distances, (d.^ - d.p. This pnsures that any 

monotone distortion of the dissimilarity axis will not affect the goadne^*of*fit of 

the configuration. Thus, if an original point is located at (d.j, 6^^, the point qorre^. 

sponding to the transformed distancerhas coordinates of (d.j,5jj)., "!^itting the 

curve,*' in this case means no more than fitting M djj values (M = n(n'l) /2). Nojte- 

^^^j^at the d^ are not distances from any configuration, but are only a monotony; 

^^^Sequence of numbers chosen as ^nearly equal" to the original d.^ as possible arid 

^ having the interesting property that their rank order corresponds to that of thi 6... * 

. - . ■• ■ • . ■ ■ ^ ., . , 
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Having determined a set of Ffumbers jnonotonically related to the set of d,,, 
f define raw stress as follows: \ ' ^ ' ' 

Ra^Stress = S* = ^ ( - . (3) 

Except for normalization this becomes the *goodness-of-fit measure. Raw stress, 
while being invariant^ under the ngid^motions of configuration (rotation, transla- 
tion, and reflection), is not mvariant under unifonti stretchmg and shnnkmg of the 
configuration. To compensate for this, Kruskal originall> divided raw stress by a 
scaling factor (T*) which , was simply the sum of squared distances. .Taking the 
square rootbf the final index led to STRE.SS being defined as: ^ 



5. = [,|j.Cd,--.3„)'/^|._d/]'". 



■(4) 



Ht^wever, m later works, Kruskal decided that rather than standardize raw stress 
simpl> in relation to the sum of squared distances, it should be standardized in 
terms of squared deviati6ni about a mean distai\ce. The alternate stress value pro- 
posed, therefore, was? 

, ' where d.^ is the mean of the distarrqe scores. 
Kruskal sugg^ts the folloviing verbal evaluation pf ^oodness-of-fit: 
TABLE i. STRESS EVALUATIONS 



St:ress(S^) 


(S^) 


Goodness*of-Fit 




40% 




Poor ^ 




' 20% 


fo% 


Fair 




\0% 


5% 


' Good 




. ' 5% 


2V2?0 ' 


Excellent ^ 




0% ^ 


0% 


Perfect- ' 





• Source: Kruskal [2a, 29]. 

^ Initial configu^'tions. Since bur worked example follows the Kruskal-Shepard- 

Tcy-gerson (K-S-T) mode rather than the Guttnian-Lingoes smallest space analysis 
iff^pd^, discussion of initial configurations will be limited to the options available for 
t'hfe^S:^rou tines. . . • s \^ - ^ 

> ^^lyjal configurations may be generated entirely without bias (i.e., randofnly) or 

maj'.^wil^dffliberately structured in an attempt to hasten Convergence of data and 
generated configuTtition. Two major alternatives are available in the Kruskal series 
of programs: • , . *i ' . - . ' 
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i) the input of a configuration "of one's choosing" (which may be^ arbitrarily 
selected): 

ii) generation ot a pseudo^random initial configuration. 

^ Tht Young'Tprgerson routine, on the other hand, uses a semi-metric method of 
.defining an mitial configuration. In this method the original dissimilarities data are 
[transformed to scalar products (U.^) using, * 



p 
LVk 



P 
s 

k 



^kj 















[(■ 













(6) 



^wficre S J is the origina l^similarity of points i and j, and f is the number of points 
(33,, p.^ 1 30) assmning the "tru^ dimensionality of the points equals r, the r 
Jar^est eigenroots of the Matrix U are extracted. Each vector is then multiplied by 
Ithe square root of its eigenroot, and a new s^t^of distances is calculated according 
lo the general Minkowski formula: 



d.- = 
ij 



'I 



± 

m 



(7) 



where is the i*^ scaled entry in eigenvector (a),*and w is the MinkoNvski metric 
fnumber selected for the problem. * , " ^ 

Thus the original data are converted to scalar products, these scalar products are 
^factor analyzed such that the datli are produced using only the first r-dimensions, 
. and thqn, finaJl>,'a monotone transformation of distances is found whicfi best fits 
,.the origin^ idea. Here "best" is determined by an index-of-fit. In fact/ the best-fit. 

problem is viewed as a regressiqn problem, with a monotone regression of distances' 
j and disparities, yyhere /'distances" (^.p. are the measurements, produced from a 
J given Lunfigurati<)n b> apply mg the Minkowski distance formula, and ^/disparities" 
are the munutoniLall> transforitied distances (referred to ia,J0u^ worked example by 
the term d..). - * ^ 

The disparities prpduced j)y the above method then becomo-^the basis for a 
second factor analysis anc( the whole process is repeated. Thus, the Young* 
Torgerson routine uses a "semi-metric" algorithm to define an initial configuration 
prior to its manipulation by^the non-metric algorithm. The routiije is called semi- 
metric because it performs the metric operations of multiplication and addition on 
the original (dissimilarities) d^ata during the first factor analysis, but uses a non- 
metric monotone transformation (the disparities) for each successive factor 
analysis. ' . ^ * 

6. Dimensionality. The final determination of the number of coordinates recovered 
for the data rests ultiniately with the experimenter. To assist in finding an appro- 
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priately dimensioned solution, the following altefnatives have been suggested [28t]. 

a) Undertake the analysis in several dimcnsions^nd plot the relation between 
stress and dim^nsionality.'Generally some nbtie^le./^elbow" will occur in 
the curve and this should indicate the appropriate number of dimensions 
(Figure D; * ' / 

b) If some given t-dimensional solution, for example, provides a reasonable 
solution (in terrfis of stress size and in terms of the ability to interpret 
coordinates),_and if a (t+l)-dimensi6nal solution produces rjo major im- 
provement in interpretability, then the t-dimensional solution should be 

used. * " [ 

*■ , » 

c) If there is an independent estimate of tfce statistical error of the data, then 
this gives one an idea of the appropriate number of dimensions to extract- 
in the sense that the rnore reliable the data, the more dimensions one can . 
safely extract, • . 



^STRESS 
(X) 



35 
30 
25 
20 
15. 
10. 
5 




Selected (t) 



u 



* 1 



/ 



10- 



DIMENSION (t) 
•Figpre I. Shepard Diagram 




7. A metric example: recovering point configurations from interpoint distances. In 
an unpublished paper, W. R. Tobler has developed a worked example for the metric 
scaliTig problem of finding the point configuration that ^ost accurately reproduces 
a given set of interpoint distances. He has generously offered to include it hel'e 
(Figure 2). Whenever the researcher is confide^ht of^the metric properties of his* 
similarities data, the procedure described below is a far simpler and more accurate 
one for recovering the scale. However, when one is confident only oT the ordinal 
relationships in the origii<Jl data, non-metric scali;\g is more appropriate. In both 
cases, however, the procedure for finding point locations from distances is identi- 
cal-hence the impoPtance.Sf this example. In non-metric scaling, the distances 
*Titted" are the pionotonically transformed distances rather than the original dis- 
tances. In meti^G scaling, the original distances remain at all^times \hc target dis- 
tances. Convergence is thui assured. In the case of^non-metric scaling, the original 

^ . % - 14 ^ " 
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distances ^ogStrain at all times the target distances via the monotonic tran^orma- 
• tion (eithenrank-images or monotone regression values, see below), but the^ctual 
values of thl^r^t (Sstances change.as the iterations proceed. 

Tobler's trijiferation problem may be stated as follows: ^ 

i) gi^e§up to n(n-l)/2 empirical distances D.^ between n points; 

ii) find the x,y coordinates of these points jn such a manner that the dis- 
tances d^^ calculated from these coordinates agree as nearly as possible . 
with the given distairces D.., Tljis means that S(D.^ - d?.)^ is4o be iViini- ' '* 
mized. 

An^iterative'graphical approximation involves the following: 

Step I: Locate the n points arbitrarily. * 
Step 11; Draw straight fines through each pair of points. ^ * 
Step HI: On each line, center a segment of the dcsified length. Omit this 

step if an observation is missing. 
Step IV: Draw vectors from eachf point ^o the ends of the ^egmients 
representing fhe desired distances. . . j ' 

* Step V: Move ead!f point to'theTriew position defined by the average of 
I the local vectors, ^ ' ♦ 

StepM: If no points have moved in Step V, stop; otherwise, use the 
' ^ new positions'to begin again at Step II, 
A computational algorithm can be devised by exarnining the relations in the vicinity 
of one.pbint in more detail (Figure 3). For each nQn^e|4 desired distance: 

/I) . compute ds,j=(qij^-^ 



(2)... compute t^e directiop cosine of the line'i 

, ' \ cose..=Xi- >Xj , . 

(5) compute th^e' change in the x direcjion at' the i*^ point wiih respect to the j"* 
*poinf from erementary trigiOnometry ast /v . m> 

. ^ • / . ^ -T' 

, dx = cos<? ds =^i2 ( xt:^>Xj) (p.. - d° ). 
Similarly, corripuO^ " . j ^> 

dy.. - sin 0. ds.^ = 1/2 (yr-yj) (i^j - d;; ). ' 4) ^; ■ 



(4) The total $|iange in the x direction is the average of all of the partial changes: 
^ ic< ^ • ri-1 j=l 




^- 16* 
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Similarly,' 



75) Set 



dy.=_L2 , dy„ 
•'^ n-lj = l , 



X = xP+dx. 
1 I I 



y. = y? + dy^ 

d'j= [ (xj-x^ + (y'i-y>] 



* (6) Compare 



n - I 



' S Pij-dij)' 
j=l 



with theijdeiired accuracy level and 3top, or go to step (1) using the new values. 



h '^^i 




Figure 3. Trilaleration Example - Explanation of Terms (Provided by Professor W 
R.Tobler, Departmerit of Geography, University of Michigan) 
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8. Joint-space scaling solutions. The same principles we have already described can 
be applied to the problem of locating m individuals and n points in a space such 
that the order of the distances between the two sets of points corresponds with the 
original' Ordered data. A simple hypothetical Example will illustrate the model. The 
rankings of five locational stimuli by fou"? sample groups are shown in Table 2. 

TABLE 2. HYPOTHETICAL RANKING OF STIMULI 
BY FOUR SAMPLE GROUPS 



Stimuli 






Groups 




LocatioTial 










Types 


A ^ 


B * 


,C 


D 


1 


3 


3 


1 ' 


1 


2 


1 


2 


4 


5 


3 . 


2 


1 




3 


4 


4 


4 ° 


3 


2 


5 . 


5 


5 


5 


^4 



Geometrical Mojiel: 



Legend: 
*l Stimuli 
X Groups 



X. 



f A 



The distances from each of the groups to each o£ the five stimuli, when rank 
'ordered for each group, have the same order as in Table 1 . The geometrical model is 
useful because it summariz.es the data in Table 1 (n + m coordinates are sufficient 
to recover n»x m original data values), and because it also allows generalizations to 
he made about similarity between groups as they order the stimuli. For example, 
distance's between the 'groups in the model space may be used as a sunynary of 
similarity of point of view. • ^ 

D. Selected Approaches to Non-Metric Multidimensional Scaling Analysis 

There are at present a number of closely related approaches to the problem of 
multidimensional scaling (MDS), and some of these varieiJ approaches are reviewed 
below. , ' . 



1. Coombs. This MDS model is based, on the 1954 work of Hays '^nd is adapted 
from' the multidimensional unfolding of preferential choice data f7„pp. 444-462]. 
The basic requirement for this jnodel is that at least a partial ordering of the 
interpoint distances between n p6ints can be obtained. The intent is to put a frame 
of reference consisting of r axes on the space in \vhich the points are located, and to 
determine the rank order of the projection of all points on these lines. Eve|;j/ pair of 
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points in space defines a line arid each line is a potential axis of the space. The aim^ 
of the technique is to select a minimal subset of the lines and determine the order 
of projections on each in such a manner as to satisfy a given partial order of 
interpoint distances. The Lriietion for selecting lines.Js a least squares fit to the'^ 
, space. 

The fundamental feature of this approach is the finding of a partial ordering of 
interpc5int distances. For an> set of individuals, a complete (or simple) order of 
interpoint distances between objects is said to occur -when exactl> similar raDikings 
of the objects are obtained. Assume there are only two individuals (X,Y^', who 
perform a paired comparison experiment on five objects (A, B, C, E) and come 
up with the following orderings of data: ^ ' * 

.ForX:EBCi)A * ' / 

ForY:ApCBE . . . .\ . 

These rankings indicate complete order in the data. They can be plotted in a space 
between individuals X and Y such that the interpoint relations for each pair, of 
distances is satisfied.. Thus, X-E-B-C-D-A-Y. Here the pair EA is defuied as being 
most dissimilar b> both X and Y and all other distance relations (such as BC,"DA, 
etc.) can be maintained In this ordering. If however the order of data for X and Y 
' was as follows: 

For X: EBCDA - • . 

• For Y: AECDB . / " . . • , 

^ then complete ofder is not obtained and only partial order is derived. Here C,and 
D can be located between B and A for both individvials^ bu^ their exact order and 
the position of E is not easil> determined. For example, all we can deduce from this 
is that E falls between B and A for Y^ut not for X (Figure 4). ' . ' « 




Figure 4. Partial Order (After Coombs, 1964) 



The problem attacked by Coombs, then, is that of r^solying partial orders into 
simple order by searching for the minimal number of dimensions required to .locate 
these objects, ^yhen projected onto the selected dimensions, the distances between 
points located in the resulting configuration must reflect thgir position and rela- 
tions in the orderings given by individuals. * * \ 

The methodology involved is an iterative procedure basedjon least squares princi- 
ples. The first step is to check for unidimensionality by'selecting the largest inter- 
point distance and "seeing if a simple order of objects is obtained. If not, partial ^ 
ordenngs are examined to s^e how many objects can be located correctly with 

A O . / . l\ ^ ' > 



respect to this dimensioi^ and incomparable relations (such as the GD order in the 
example above") are resolved where possible. A second dimension is then chosen 
from the remaining incpmparables, and another attempt made to produce a simple 
order of these remainicjg distances. Success \vou!d 'indicate a two-dimens]on4solu^. 
tion to the original problem, failure involved selection of alHird and perhaps othe^r 
dimen^ons using the same principles of selection. Fnnal axes are generally '"reason- 
ably orthogonal" to each other. The end result is a configuration of points in an 
r-dimensional space^^ ' 

Z Torgerson. Torgerson's original methodology appeared to have much in common 
with factor analysis and, in fact, used a centroid method of fMCtor analysis to derive 
the factor scores which were used to define the final configuratidn.of object-points. 
Taking as input triadic comparisons data, torgerson converted initial similarity 
proportions into standard (Z) scores. These scores were then, transformed into a 
sin^e matrix of '"comparative distances" between objects and an attempt was made, 
^^t a find an "additive constant" which translated the comparative^Sistanoes to ab- 
solute distances. In other words, the absolute dist^ce between objects (d..) was 
regarded as being a combination of some perceived or psychological distance (h..) 
and the additive constant (C). ^ . . 

Torgerson {57l suggested that both the formal mathematical method of Messick 
and Abelsori and hi^ own shortcut method could be used to define the additive 
constant in ^ny given experimental situation. Once a matrix of absolute distances 
had been compiled, it was transformed again to a scalar product matrix-(B*) and 
referred to'an origin at the centroid uf all tlie stimuli. The B* matrix was. then 
factored' (using centroid procedures), the relevant factors examined, and factor 
" scores us^d as coordinates on each dimension in order to specify the nature of the 
final configuration of points. The essence of this approach, therefore, was to find 
the kind of distance furtction that was necess.ary to*' convert (psychological) 
measures (|f similarity inta real-number measures. 

The differences between this early approach and the one examinf^d in detail in 
this paper result from the adoption of a weaker assumption (the monotonicity 
assumption) in transforming psycholpgical to real distance measures. While some 
basic similarities are retained, the emphasis* in the TORSCA program used here js 
not one ot finding an additive, constant but rather one of developmg a set of 
monototiically related measures which cprresppnd to the onginal dissimilanty 
measures. Since our^ detailed example breaks down the TOJRSCA algorithm mto 
each of its component garts, no further expansion is necessary here. 

3. Shepari Unlike much of the earlier'^ work in MDS, Shepard {50] did not try to 
deal explicitly witi? a definition of the distance function required to translate 
*^psy chological" into "real" distances. His aim was to find a configuration of points 
in a minimum number of dimensions such that 4 plot of psychological distances 
against real distance would reveal the specific function needed f<Jr the translation. 
Jfc technique was to allocate a set of (n-1) vectors to each of n-stimulus points 




.directed to each of the other (n- 1 ) stimuli, and to indicate at each iteration how the 
stimuLiS point should be moved in order to improve the monotonicity requirernent. 
At each iteraUon there was simultaneous displacement of all these points. 
' In order t^^ obtain a minimum dimensional configuration, however, another set 
uf (n-l) vectbrs was defined for ever> (n) point which auned at increasing the 
variance uf distances at eachuteration. B> aiming at an increase in the Variance of 
distance it was possible lb collapse the configuration mto smaller spaces. Tp effect 
this increase in vanan^e, fefge distances were made larger, and small distances made 
smaller. The Iterative procedure was halted when an index (S) defined as.* 



s s 

_LiJ 



(S.'. -S(d..))2 / n(n-l)J'^' 



<8) 



where S' =^ proximity measure between Sj and S: 
and = number of stimuli. 



proximit> measure corresponding to the computed distance fd..), 



became *'small enough''. Since the stipiuli would be still defined in terms of co- 
ordinates in an (n-l) space, tn order to expose the minimum dimensional configura- 
tion he ha4(hke Torgerson) to determine a scalar product matrix, obtain a roots 
and vector solution to the matrLx, and eliminate "unimportant" axes. ^ , 

4. Kiuskal. Shepard's focus on monotone relationships rather than a specific mathe- 
matiLai function design.ed to translate ps>chological into real distances piovided ihe 
impetus for Kruskal's approach to multidimensional scaling [28, 2% 30] . His basic 
problem wa^ to find a set of distance measures which could be related to the 
dissimilaritv measures such that monotonicit> is not violated, and such that a 
monotone regression of dissimilaritv and distance >ields a minimal value for 

STRESS. - , 

After, generating an arbitrary initial configuration which is used to calculate 

distances* and to generate dispanties, Kruskat prodi^es convergence of the configu- 
, ration and onginal data b> a method knovyn as the method of steepest descen^t(or 
- method of gradients). Fn-st the gradient of raw^stress is determined for each point 
* m the configuration in the space of specified dimensionality,: 
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i - dij(t)- 



Xja (t) - X,a (t) 



(9) 



where g.^ is the gradient of stress for distance (i) al location (a). 
dij(t) is the estimate of distance at iteration (t), 
djj(t) is the dCtual distance measure at iteratipn (t), 
: and\.^ & x.^ are orthogonal coordinates of pcJints \ and j. 
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The gradient provides directional information for moving the points in the con- 
figuration along the path of steepest crescent such that S(t+ O < S(0 (S = Stress).Jn 
essence, the gradient is determined by taking the first partial derivative of stress 
with respect to each point (assuming the partials are negative) [29, p. 1 18] , 

' (10) 

Once the directions of movement for points in the configuration have been 
deterrtiiR^^d, the distance that each is moved (step size) is determined, using: ^ 

i) an estimate of the previous step^ize (a) - for the first movement a can be 
arbitrarily set (e.g., a = 0.2) [:?l,p. 121];- 

ii) an angle factor "(f,) which represents the cosine of the angle between 
successive gradients in successive dimensions*(defined as4.0 (Cos^)3 where 
0 is the arigle between successive gradients); 

iii) a relaxation factor (fj) which is defined as: 

• / f2 = ^ • ■ - 

-/ ^ : • 

J 1 + (5-step-ratio) ^ o 

where tlje fiye'-step-ratio = min \, f present stress , 

L ^ stress 5 iterations ago ^ j 

v) a *'good luck" factor (f^) defined as: 

fa = min fl. ( present stress \1 
^ . \ prions stress J J 

Note in ii) aboye, if g represents the present grad7?nt and g" the previous one/theji 
[29. p. 122 J.- . • - • 



■ cos e = i, s gi, ■ gj. 
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Thus step size atjteration (t) is defined as: { 

a(t)=a(t-l),f,,f2,f3. 

This provides for relatively large steps at the beginning of the iterative process and 
fairly small stepVtowards the end. Although the length of gradient is partly used as 
a source of information for termination of the iterative process, Kruskal relies 
primarily^ on the angle between successive gradients to guide the iterative process. 
The calculation of step size ih term^ of STRESS helps to partly overcome the 
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Ofblem of local minima, for if STRESS is large the iterative procedure will con- 
Jue, while for low STRESS the local minima may prove a "satisfactory" configur- 

he outcome of manipulating distance estimates is to obta^I^a.^^flguration of 
|lnts in a' space of specifie4 dimensionality in vvWch the distance r&ktions con- 
John to the original dissimilarity measures. Like factor analysis, the problertr«iay 
^filien arise as to how to interpret the dimensions of the configuration, but at least^ 
t<me is aware of the number of attributes being used by a population ia order to 
ce conniparative judgments concerning the original stimuli. 

uttman-Lingoes. These multidimensional scaling programs known as the Small- 
Space Analysis (SSA) series are developed from^he most cpmprehensive alge- 
sic treatment of multidimensional scaling to date [ 18; 35] . 
W The starting configuration is similar to that in TO^SC A described above except 
Snat it is^ the rank order of the input similarity measures, rather than the s.. values 
Jthemselves. that are operated on and the initial factoring of the rank order matrix is 
\ot an iterative procedure as in the TORSCA algorithm. The loss function (0) (cf. 
ruskal's ?tress), is defined as: . * 

n . * 



= 2 (d. -d.-)^ 
i^ 1 



(n=l/2n(n-l)) 



5lhd the hotn^ized phi: ; , - 
^ ' ' » n 

. xr^ ^ = 2 (d. - d*f. I z d.2 

i = 1 % • i = 1 
^herS d.* are the monotonically transformed values of d^kno 
Wimages*' of the d [18, p. ^79; 33^, p. 9] . These rank images are obta; 

ithe dj.and by placing theffi in the cells corresponding to the"cells r; 
sjjhe in|)ut similarity valubs (s..). Th^ts,'the smallest interpoint distaj 
t;^^-from* a given configuration becomes Jhe rank-image of the inte 
* ^*correspondii\g to the smallest Sjj value . . . and so'on.- 
J:.Jhe dj* serve two purposes: first, they provide^ target estimates t; 
, -""^^nk-Order corresponds to that o if the input datj,*niay be used^to gt 
. ' ^;Jter"ations toward a solution configuration; aiid second, they provide 

t ' i ' ' ' ' * ^ 

be used to measure progress in teaching a final solution. Thus, wh 



7 0, perfect fit obtains. The measure of unexplained variance (the 
alienation'*) is defined: ^ . ' ' 

I • ^ K=ri-(i-0)^]'/3. - 

A fhe method of minimizing K is known as the "Correction Matrix" me 
«7 similar to that used in the non-metric algorithm in the TORSCA routine (d 
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in detail t|\^e worked example in Section 11), with modifications in the step»size 
a4justment parameter (a). In the SSA series, alpha varies not only from one itera- 
tion to another (as ii^both the MDSCAL and TORSCA algorithms) but also from 
one pomt to another:* A further refinement in the SSA series is that iterations are 
allowed to proceed for a.given set of d.* before the re-definition of a new set of d * 
on the bads of newVomputed values of d , (known as the two-pliase process). Some 
advantages, are derived by formulating the algoijthm in these terms [18 pp. 
485486]. ' * . . 



E. Selected Problems in Non-Metric Scaling 

i> ' . ^ - ^ 

We cliscuss below two major problem areas which must be confronted in using 
multidimensional scaling. One is an, interpretive problem-that of finding substan- 
tive meaning in the dimensions- of t^e recovered configurations. The other is a 
problem in algorithm construction and is concerned with ensuring that whatever 
goodness-of.fit function the algorithm has attempted to minimize has in fact been 
minimized. Some of the earlier scaling programs were particularly prone, under 
certain conditions, to lead to "solutions'' that were far (rom the qptimal possible. 
This problem of local minima and convergence is discussed first. ^ 

« , i. • 

^ K Local minima and the convergence process. A goal of all of the scaling algorfthms 
discussed was to avoid situations in which po^s would becom^ located'such 'that 

^mall movements, fifo\yever computed, would always lead A higher stress even 
though some other configuration of the points might exisj/for which stress' would 
be appreciably lower. A number of strategies have been s^gested to ensure that the 
researcher will not present such interim solutions as finffl ones. 

^. Multiple solutions. In this strategy [25, p?^8r 33, p. 117] the researcher 
computes a number of solutioris from differentMially random) starting configura 
tions. Thus the hope is that local* en traprnent 
some of the "runs.'' With this strate|y, the 
accepted [20] . 



lints will be avoided in at least 
ist-fit ting' solution configuratio^mis 



Selection ofh^^^Bf/^frafton. Eviden, 
[^^] tha^^M^pRnigurations influen 
jninirn^Hp^^^R^ndom or arbitrary coi 
thij,(pfflKejy[33', p. 126] , Sometimes, the re! 
S(:^0S me properties of the final con figurati^ 
^a4onfiguration\coritaining these prope. 
goes and Roskam* [33, p! 1*27] recommend a 
pmpute a starting configuration for the non-metri 

t 

fc. Backup procedures ^ to confirm convergence. In ht? 
MDSCAL algorithm [28], the^ technique employed to mi tig 




pions of Kruskflfs 
iQ the occurrence of 
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local minima problems is to retam after ai!^ iteration the capability of returning to 
the configuration pnor to that iteration if either stress has increase^J by 20% or , 
more, or if the angle of the gradient after the iteration (computed to determine the 
direcUon of movemlint for ain pumt) is close to being directl^ oj^po^ite (180°) to ; 
its value pnor to the iteration. • • » * 

J. Varying the target distances. The distanLes that the configuration points are 
attempting to match are of crucial significance to the local minima problem. While J 
the dj^ of Kruskal ha\e the advantages of speedily converging (when<;us&d with a^ 
sensitive step-size adjustment procedure), they have the disadvantage of changing in 
value between iteration^. With both configuration distances andMafget distances 
changing, smooth convergence is not guaranteed, indeed, erratic behavior some- 
times occurs. Lmgpes and* Roskam [33, pp. 133-136) have experimented with an 
algonthm in which Jhe target distances are, alternatively, the d* (rank-image dis- - 
tances-used tO nlinimize the Guttman-Lingoes coefficient of alienation) and the d 
(moftotone-regression distances or disji^arities) used to minimize stress. Thjs 
sequence of targets applied to experimental data in a succession of analyses showed 
very low incidences of local minifha. Thay conclude that, although this strategy 
involves more iteration! than one which stayed with the one set of target distances^ 
It is sufficiently impressive m avoiding local minima problem^ that it is incorporated 
in their revised algorithm MINISSA. 

2. Interpretations of scale dimensions. One of the fundamental problems in the use^ ^ 
of MDS IS that of interpreting the dimensions in which configuration^ are mapped. 
In some cases this particular problem does not arise because* the researcher* is 
interested only m the position of relation of the configuratiX)n points with respect 
to each other (for example, in determining clusters of stimulus points). HoWeVer, in 
other cases the scale value derived from projecting any ^ven pcHnt onto ati appro- 
priate dimension of the space is sought after. Under these'circumsJances the inter-' 
prctability problem arises. ^ . • * 

The problem of identif>^r)g the dimensions of any configuration iias been 
esolved m a number of different ways. Tor many of the geogrjfphical studies 
er/tioned in this paper, some information is known about the location^of stimulus 
an objective space prior to the building of a configurationrWlien this type 
lation IS known, constraints c,in be ^placed, on the number df.dimensions in 
whic!^3Woutput cun%uration is produced such that one attempts to replicate the 
spatial IS^^re of the original" objective configuraSion. Thus in Tobler's use of 
MDS algonl^^Ssas m.ap transformations, he is able t j) orient configuratipns^.ffl the 
same way as pli^elected map "projections are oriented, and to ijjJterpret distances 
accurcjing to disSoces measured on these projections. Sinlilaily, in the urban 
distance perception sH^dies c^scussedjater, the locations of the phenqmena being 
investigated were knovvn^ior to the beginning of the Study. Output ccTn figurations 
could therefore be rotated, reflected', artd translated until the positional relations in 
the output configuration have the san\e directional components as In the original 
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data, both of these cases, the problem of identifying the dimensions themselves 
are trivial ones. . . » 

In cases where the configuration of stimulus points is not known, considerably 
ingenuity has been used in order to interpret dimensions. For example, m one study 
(III-A-2) 'a» large number of personal, social, economic/ and attitu4inal 
characteristics' were collected for each\ample respondent, and various types of 
characteristics were collected for the stimuftirpoints. By investigating the scores of 
stimulus points and individuals on unidimensional scales of each attribute, the 
autfibr was abk to interpret his dimensions by choosing those attributes which 
appeared to be most highly correlated with the derived scale values. In {bother 
study (III-A-3), dimensions of the configuration Were interpreted ip terms -of 
qualities of the stimulus objects that had been derived from independent scaling 
arialysis. Thus,^hen the. configuration of^stimulus points is^unkuown, it appears 
that the most appropriate method for identifying dimensipns Is to compare the 
scores of each stirftulus point on each dimension with some prior selection of 
attributes of the stimuli. Those attributes having the highest correlation with scale 
values then lend themselves to use in interpreting dimensions of the configuration. 

Another point which is -seldom discussed in the literature is that of the 
positioning of the axis of t^e configuration space. It appears that, for the most part, 
sc*alr^^i4ues are referred to the centroid .of various configurations. 'Jhis means in 
effect thX^n arbitrary zero point is established, and the data can be interpre'ted at 
no Higher l>yd than an interval scalg. In terrns of the problems associated with 
interpreting 'configurations, therefore, there are certain similarities with other 
multivariate HechrHgues such as principal components and factor analysis. We may 
conclude that th^te are corfespjon ding difficulties in ^ the interpretation of 
dimensions just as ther^si^ in these other metric methods of multivariate analysis. 



ERIC 



« 26 



00037 



IL THE NON-METRIC ALGORITHM: A WORKED EXAMPLE 



Our disciission to this point has cei^tered on the objectives behind multidinfien-^ 
sjonal scahng approaches and on the question of the appropriate criteria for evaluat- ^ 
ing goodness-of-fit- Several algorithms have been developed to meet the stated 
objectives, and onl^ recently have any systematic attempts been made to compare 
the relative performances ^f the different algorithms [33, 43, 53, 62]. It is our 
purpose here to explain multidimensional scaling through the mechanics of a hand- 
worked example, and we choose in the example below to solve a problem using one 
of the^ several scaling algorithms. o 

The particular one chosen was -based on the availability, at the time of writing, . 
^f a clear description of the algorithm. The one used is that developed by Young 
and Torgerson [61, 63], and it is applied below to a very simple, hypothetical 
problem. » In the following section of this paper we will discuss more realistic 
examples. Ho\yever, all such applications of multidimensional scaling techniques to 
date have been with data containing too maay points for solving in a hand-worked 
^example in a reasonable length of time. Even the simple hypothetical problems 
described below would take a day or so to solve by hand-calculator^methods. 

A. The First Example ^ ; 

The problem wp outline il^^one iiKwhich the solution is known^at tlie outset, but 
the worked ,exam{^\e below is suclKhat the information derived as a s)^lution is not 
explicitiy contained iti any of the initial data from which it procee(^ The multi- 
dimensional scali|)g "problem recall is essentially one of finding thqf.locations of 
points in a^pace oj any given number of dimensions such that an ordering of the 
distances bet\yeen pprnts in this space best corresponds to an ordering of the points 
in the original input data. Thus, it is implied that the researcher possesses knowl- 
edge on th^ order relations between a set of points (usually from experimental 
data) .and that he hypothesizes these order relations are derived from a mental 
coi-'^'Turation of the points (unknown, of course, to the researcher). The purpose of 
the scaling, then, is to construct a configuration of points from which measure- 
ments can be made between points on which the order relation c<^rresponds to the 
order relation of the int^rpoint distances in the experimental data. - , 

In Figure 5 we portray an arbitral configuration of four^oints in two dimen^' 
sions, in Table 4Mhe distances between the points are shown; and in Table 5 the 
.order relation of these distances is shown. In the w6rked example, ^e information 
of Table 5 is the oply part of this initial data that is used. HoWever, since it is 
derived from Table 3 which in turn is derived from Figure 5, it is possible t6 check 
the accuracy of the solution developed below by ^Avaring it with the arbitrary 
configuration in Figure 5- In bri&f, thereforg, the pWiem can be described as that 
of deriving the essential^properties of tliis figure solely from fhe information con- 
tained in Table 5. 
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JLE 3. COORDINATES.FOR THE ARBITRARY CONFIGURATION 
IN FIGURE 5 





Axis a 


Axisb 


Point 1 


1.0 ' 


1.7 


Point 2 1^ 




1.2 


Point 3 




•2,0 


Point 4 


2.15 


0.4 



TABLE 4. INTERPOINT DISTANCES IN THE ^ 
ARBURARY CONFIGURATION 





1 


2 


3 


4 


1 


0.0 , 


0.5 


1.09 


1.75 


2 


0.5 


0.0 


1.25 


1.30 


.3 


1.09 


1.25 


0,0 


1.60 


4 


1.75 ^ 


1.30 


1.60 


0.0 


TABLE 5, 


ORDER-RELATION OF DISTANCES 


IN TABLE 4 




1 


2 


3. \ 




1 


0,0 


1.00 


. 2.00 \ 


6.00 


^ 2 


' 1.00 


0.0 


, 3.00 




* 3 


2.00 


3,00 


0.0 


5.00 


- 4 


6.00 


. 4.00 


5.00 


0.0, 



1. OveralI^|y|jtegy. The. problem is solved in a ser^ ofjnterations, each one of 
NVhich compnses four stages. We begin w th a rando^' configuration of the four^ 
points in twO'dimensions*'(the initial configuration). ' 

The strategy for solution is to move successively closer to th^ solution jon each 
iteration, sti>pping when the index of fii shows that the pi'evious iteration has 
resulted in a*Yiew conrfigurStion that is not superior to that o"f the previous one. 

2. The four stages. Step \ , Computafiori of distances: Distances between the points 
of any configuration are computed frWn the formula: 

■t 

2 ( 



- W 



r 



l/m 



(14) 



The worked example does not exactly pajaWel the method suggested by Young and Torger- 
son [63] since, in the interest of reducing theYat^l number of iterations required to reacKa 
solution, they suggest a strategy of first preparing an initial configuration by assjgjiing metric 
properties m the input data and then optimizmg the configuration by the method outlined 
below. In the first section of this example we confine ourselves to a discussion of the hon- 
metric algorithm since this is the essential step common to all ipultldimensional scaling tech- 
niques The computation of initial configurations with properties approaching final configura- 
tions is discussed later in this section, ' , i 
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'^'^ --j^ere i and j represent any two points in the configuration, x.^ and x. are Ihe 
^ coordfa^tes of the points on axis a, r is the numb^ 9^ axes (dimensions) for which 
^ "'"^ the solu^on is being computed, and m is the Minkowski constant^ [52] determining 
the type of distance metric for which Jthe solution is being determined,^g., 
^Eu^idean distance has the M^kowski constant of 2. . * 

Step n. Computation of monotonically transformed distances (disparities): The 
purpose of this^step to^onstrain the distances from the given configuratiiSi so 
that they do not violate the o^^r rel^tFoli of the originai iiiterpQint tlistances (in 
this case. Table 5). THiis is achieved by efisurin^ that if one were to plot the varies 
* * of this table (y axis) against the monotonically transformed values of Table l\x. 
axis)^Oie "curve" joining these points would never move to' the left but only 
vertically or to the right. These monotoificallyk trar^sformed distances ^re known as 
"disparitiesi'; they are not distances from anyiknown configuration but rather are a 
monotone sequence of numbers as "nearly equal" ^to the distances in the given 
i^Jifiguration *as is possible without violating the original order relation of 
interpoint distances. . It' is reiterated that the purpose of non-metric^ 
^j/ (ilultidimensional scaling i^ to c6nslruct a configuration of points in anyt given 
n%iber of dimensions such that thfe interpoint disSnces and the monqtonically * 
C. "constrained distances (disparities) are as similar as is poBibl6 [2S, 29, 61] . 

TABLE 6. COORDINATES FOR THE INITIAL (RANDOM) CONpfCURATION 

IN FIGUJfE 5' ' 

' Axis a / Axis b " , 

- — ^ Poin^t 1 , 4.70 /O.IO 

Point 2 ■ 3.60 "3.90 

Points X- 1.30 3.60 . 

\ * Point' 4 W . 1.70 ^^^.J.OO 

• m t^^'' * ' 

TABLE 7. INTERPOINT DISTANCES IN THE INITIAL CONFIGURATION < ' 

. . f- . ^ . > " ^ 

_ ^ 1 2 3^4 

• 1 ^ 0.0 3.95 ^ 4.88 3.14 

2 ' 3.95 KO" 2.31 3.47 . 

> ^ 3 /'4.88^^ 2.31 0.0 ^2.63 

^ 4 "3.14 ' 3,47 2t(S3*/\„ *0:0 - 



< 



f 



^ ' The disjyaritiiBS are pomputed by taking the -interpoint distance from the 
V .configuration (Table 71 con-esponding to the two tnost similar piints (smallest 
distance) in the original^^'data (Table 5) and comparing It with the distance 'that 
conesponds with the n^xt two most similar points. If the first distance is smaller 
than the second, then the order of the distances con-esponds with the order of the 
orignal similarities and na transformation is necessary. However, if the first 
distance is larger than* the second, then the original order jelation of interpoifit 
distances is violated; since the disparities must not decrease in value when the 
similarities increase, the arithmetic mean of the two distances is substituted for the 
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distances and this mean becomes *the first two disparity values. The interpoint . 
distance from the configuration corresponding to the third smallest original sinTi- 
larity (Table 5) is then compared.with the second disparity (which^might be the 
second distance' or the mean of the first two distances--as discussed above). Ifjhe 
third distance is larger than the second disparity, it becomes the third disparity; 
otherwise, the mean of it and the previous disparity (weighted mean if the previous 
disparity was composed of more than one distance) is computed, and this mean 
becomes the disparity value for the third distance and second' distance (also for tjie 
first distance if the second disparity was itself a mean value): In the first iteration of 
the problem described below, the disparities will be computed step by step follow- 
ing the pfocedure outlined^above. * J 
' , Step III. Computation of goodnessof-fit (Stress): The measure of go6dness-of-fit 
is a measure of, how far the disparities (d^.) depart from the^distances meastired 
from the derived configuratiori^(d..). The larger these departures arenas compared 
with the distances themselves,* the greater the error in reproducing the, order rela- , 
tion 0^ the oiriginal similarities, from the derived configuration and therefore the* 
poorer the fit. Kjuskal's "stress" values as defined in equations (4) and (5) are both- 
com^ted in the example outlined here. The smaller the stress, the better the fi\. 

Step IV. Computation of a dew (improved) configuration: As noted, the greater 
the discrepancy between the distances and disparities from any configuration, the 
poorer the configuration. Therefore, to improve any given configuration each point 
should be moved so as to reduce the average discrepancy between the distances and 
the disparities with respect to the other points [29,' pp. 1 17- 123; 61, pp. 6-7] .,If 

^ U point i should be moved closer to point j by an amount proportional 
to the size of the discrepancy. Thus, ceteris ^paribus, after this adjustment the 
discrepancy djj - d,j should ^e smaller on the new configuration than on the 
previous one. However, since for each of n points there are n - 1 distances to the 
other points, there will* be n — 1 possible adjustments for each point. TJie mean of 
these possible, adjustments is the actual adjustment that is made. In the worked 
example that follows, the formulae used are from Young [61 „ p. 6] . The displace- 
rrtentof poinfj with respect to point} is given by: '*i 

\ = rK-^ji^ 05)- 

where dj^^re distanjies computed from the previous configuration, 

d,Y are the disparities (moftotonically transformed distances) from the pre- 
vious configuration, ' ^ - , • 

Xja and are Uie coordinates of {on^ i and j respectivel]^on axis a, 
and a is a constant proportionality. 

The newposition of point i on, axis a is tlie coordinate on the previous configura- 
tion plusnhe me^ns of the correction vectors defined in (l5) above: 

» • k; j = 1 ■ v. 

<i> ■ ^ " ' ' ' %31 ' * 



x' 

la 



X., + C. 
la ta 



(17) 



Alternative formulae for reaching the same goal are discussed elsewhere, [29, pp. 
* n 7- 1 26 ; 1 8, pp. 484-500; 4>, pp. 23-29) . 

In the literature of multidimensional scaling, alpha is con\nionl> referred to as 
the "step-sizef parameter." Much discussion has centered on the question of modify- 
ing the value X)f alpha as the iterations converge on a solution [18, pp. 491 -492, 
29, pp. 121-123; 43, pp. 23-29] . th€ method of computing revised configura- 
tions us^d in this example, larger values of alpha would speed up cpnvergence (that 
is,' lead to a given solution* in a small numbe/ of iterations) but subject to the^ 
constraint, that if alpha becon^es too large, there is the danger of moving the points 
in-any revi^^dcpiifiguration too far. In this example, for the" sake of simplicity^ 
^Iphaiagiventnevalue of J.6.' , ^ 



3. Application of the four stag^ to the sample problem. Step I. Compute the 
distances: Applying formula (14) to the coordinates of the initial configuration 
(Table 6) yields the information in Table 7: % 



e.g., d 



23-((-2a-S;)'-^(-2b-S^)'V'' 

' = [(3.60- 1. 30)^ + (3.90- 3:60)2] 
= 2.31 



Step II, Compute the disparities: Following the ordfr relations of the distances 
^ in the original arbitrary configuration (Table 5), the distances in Table 7 must be 
transformed so that their order does not violate the order of Table 5. This is 
achieved in a series* of trials as set out in Table 8 below. In Trial 1 the distances in 
. the initial configuration corresponding to the first two distances in the original data 
. are seen to satisfy the constraint of ascending order. However, the third distance 
, (2.3 1 in Table 7) violates this order, and therefore in Trial 2 it is combined with the 
previous distance (4.88) to produce a Tpean of 3.60. Since this new number is 
smaller than 3.^5 (the first distance) it must be combined to produce a new dis- 
parity of 3.71 (Trial 3). In*such a manner, the final s^t of. disparities are computed 
in Trial 6 where all have the value of 3.39. 

Step III. Compute stress: Applying formula (4) to the distance-yalues in Table 7 
and the disparity values on the final trial in Table 8 yields: 
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4.347\"2. ' '_ 

—. ) = V .05908 = 0.234; (S= 1.0). 

.73.571/ 



T.ABLE 8 COMPUTATION OF DISP.ARITIES IN THE 
INITIAL CONFIGURATION 







Corresponding 








Interpoint 


distances in 








order in 


initial 




Dispanties(djj) 




original data configuration 




Rank 


(Table 5) 




Tnal 1 Tria^2 


.Trial 3 Trial 4 Trial 5 


1 


L 2 


3.95 


3.95 


3.71 3.65 3.45 


2 


l»3 


4.88 


4.88 ~ 3.60* 


3.71 3.65 3.45 


- 3 


2.3 


2:31 ^ 


2.31* 3;60*' 


3.71 3.65 3.45 


4 


2.4 


- 3.4-7 




3.47* 3.65 3.45 


5 


•3.4 - 


2.63 




2.63* 3.45 


6 


1,4 


.3.14 




3.14* 



Trial 6 

3.39 
3.39 
3.39 
3.39 
3.3^ 
3.39 



•indicates rank-order violated 



Step IV. Compute a nr^ configuration. Appl>ing formulae (15) and (16) and 
substituting the distances fromjable 7 and the dispanties from Table 8, the adjust- 
ments to the coordinates on both axes of the initial configuration arexompuSd 
below. - " . ^ 

ITERATION! 

Point 1 on Axis 1 

• - 

(3.95 - 3.39) xl(3.60 - 4.70)/3.95 = -0.16-from formula (16) - - 
(4.88 - 3.39) X C1.30 - 4.70)/4.88 = -1.04 
*(3.14" 3.39)X (1 70 -4.70)/3.14= 0.24 • 

• ^P?24 1^ meanmove (alpha =A.OO) — from formula (17)* 

Point 2 on Axis 1 

-^^3.95 - 3.39) X (4.70 - 3.60)/3.95 = 0.16 
(2.31 - 3.39) X (1.30 - 3.60)/2.31 = 1.08 
• (3.47 - 3.39) X (1.70 - 3.60)/3.47 = -0.04 



V /(al 



0.30 is mean movJ (alpha = 1 .00) from formula ( 1 7) 

Point 3 on Axis 1 , - ' 

• • (4.88 - 3.39) X (4,70 - 1.30)/4.88 = 1.04^ 
' (2.31 - 3.39) X (3.60 - 1.30)/2.31 = -1.0^ 
%' (2,63 - 3.39) X (1.70 - 1.30)/2:63 = -O.jJ^ 



-0.04 is mean move (alpha = 1.00) 
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Point 4 on Axis i . 

(3.14 - 3.39) X (4.70 - 1.70)/3.14= -0.24 
(3.47 ^ 3.39) X (3.60 - 1.70)/3.47 = 0.04 
(2.63 - 3.39) X U .30 - 1 10)12.63 = 0. 1 2 

-0.02 is mean move (alpha =,1 .00) 



Point 1 on Axis 2 

(3.95 - 3.39) X (3.90 - 0.!0)/3.95 = 0.54 
(4.88- 3.39) )$73.60- 0.10)/4.88= 1.07 
(114- 3.39) X (1.00 -0.10)/3.M = -0.07 

0.38 is jnean move (alpha = 1 .00) ^ 

Point 2 on Axis 2 . 

(3.95 - ^.39) X (0.10 - 3.90)/3.95 = -0.54 
(2.31 - 3.39) X (3.60 - 3.90)/2.31 = 0.14 
(3.4'? - 3.39) X (1.00 - 3.90)/3.47 = -aO? " 

-0.1 2 is mean move (alpha = 1 .00) ^ 



Point 3 on Axis 2 

(.4.88 - 3.39) X (0.10 - 3.60)/4.88 = -1.07 
' (2.31 - 3.39) X (3.90 - 3^60)/2.31 = -0.14 
(2.63 - 3.39)^ (KOO - 3.60)/2.63 = 0.75 



-O.I 1 is mean move (alpha = 1.00) 

Point 4 on ^xis 2 

(3,M 1 3.39),x (0.10 - 1.00)/3.14 = 0.07 
(3.47 - 3.39^ X (3.90 - 1.00)/3.47 = 0.07 
(2.63 - 3.39) X (3.60 - 1.00)/2.63 = 0.75 

—0.1 5 is mean move ( alpha = 1 .00) 

. ^ *♦ 

The new coordinates are computed by adding the mean move of any point on 
any axis to the prior position of that point on that axis. Thus, from Table 6 and the 
computations above a new set of coordinates is calculated: " , 

/' • Poiftt 1 on ;\xis 1 : 4.70 + -.24 = 4.461 . ■ 

* Point i on Axis.-2: 0.10 + 0.38= 0.484 . 

* Point Zon Axis \ : 3.60 -^ 0.30 = 3.897 

• , Point 2.on.Axis 1: 3.90 +^-.12 = 3.784 ^ 
•••^ * ' 'l-<'^' . . ' ' 

. . Point 4 on Axis'?r-ll00+ -.1^ = 0.847 * 

"Missing information ruay be calculated from information previously given. 
• S 111- 34 

ERIC 



^. i f 



TABLE 9. CONFIGURATIO*N 



lie 

4 • 



' 1 

4.461 

3.897 

1.262 

1.680 



2 

0.484 
3.784 
3.486 
0.847 



Step V. Compute the new Sistances (Le., repeat Step I). The* new interpoint 
distances can be calculated from formula (14) and from the cpordinates, given 
above: ' * ' ■ 

e.g., dj 2 = [(4.461-:3.897)2 + (0.484-3.784)^] ^ 



= V 11.2081 
« 

= 3348 



Remainmg distances can be calculated from data in Table 9^ above. They will hav^ 
the valueslirTable 10, Below: 



TABLE 10. DISTANCES 



1 

3-i 



1 ■ 
0.0. 
3.348 
4.387 
2.805 



2 

3.348. 
0.0 
2.652 
3.680 



3 

4.38J 
2.652 
0.0 
2.672 



4 

2.805. 
3.680 
2.672 
0.0 



Step VI. Compute the new disparities (Le,, repeat Step JI): These new values are 
given in Table 11. ' ^ 



Order of inter- 



m 



Distances in 
V * the new 



Disparities 



^riginal data^ 


configuration 


Trial 1 


^Triar2 


Trial 3 


Trial 4 


Trial 5 


1:2 


3.348 


3.34? 


3.348 


3.348 


3.348 . 


3.257 


• K3 


4.387 


4.387 


3.520 


3.520 


3:348 ■ 


-3.257 


\ 2.3 


. 2.652 


2.652* 


,3.520 


3.520 


3.348 


3.257 


2.4 


■ 3.680 




« 3.680 


.3.176* 


3.348 ■ 


^.257 


3.4 


2.672: - 




2.672* 


3.176* 


3.348 


3.257 




2.805 








, 3.348 


3.257 


1? ' . 


V 








' 2.805* 


3.257 



*order relation violated 



TABLE 11. DISPARITIES 




3- 
4 



I 


2 


0.0 


3.257 


*3.257 


0.9 


3.257 


3.257 


3.257 


3.257 




35 



3 ^ 
3.257* 
3.257' 
0.0 
3.257 



4' 
3.257 
3.257 
3.257 
0.0 . 
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Step' VII Compute the new stress (Le.. repeat Step III). Applying formula (4) to 
the distance values of Table 10 and the disparity values of Table 1 1 yields. 

Sj = ^1387 = 0.1897: S = WOO 

y 66.03 

(Compare with previous stress value of 0.243 in Ste^ III above.) 



The second iteration and* those that follow begin with Step IV and repeat 
Steps V; VI, and VII. Results of these steps are given below. All necessary 
information for computing the results below is given in the description of the 
first iteration. ^ 

f 



ITERATION 2 



y. Compute a new configuration: 



Point 1 on Axis I 

(3.35 - 3.26) X (3.90 - 4.46)/3.35 = -0.02 
- -(4.39 - 3.26) X ( 1 .26 - 4.46)/4.39 = -0.82 

(2.80 - 3.26) X ( 1 .68 - 4.46)/2.80 = 0.45 

— . 1 0 is mean move (alpha = j .00) 

, Point 2 on Axis I ' * * 

(3.35 - 3.26) X (4.-^6^- 3.90)/3.35 = 0:02 
(2.65 - 3.26).X (1.26 - 3;90)/2.65 ^ 0.60 
(3.68 - 3.26XX (^l.68^-^^.90)/3.68 = -0.25 • 

D.09 is mean niove (al^ha = 1 .00) 

Point 3 on Axis 1 * . 

(4.39 - 3.26) X (4.46 - l.r26)/4.39 = 0.82 
(2.65 - 3.26) X (3.90 - l.26)/2.65 = -0.60 
^ (2.67 - 3.26) X fl.68 - 1.26)/2.67 = -0.09 

0.03 is mean move (alpha = 1.00) 

Point 4 on Axis<*l ^ * . ' * 

(2.80^ 3.26) X (4.46 - 1.68)/2.80 = -0.45 
(3.68 - 3.26) X (3.90 - l.68)/3.68 = 0.25^ 
"(2.67- 3.26) y (1.26 - i.68)/2.67= t).09 

-0.03 is mean move (alpha = 1 .00) 

Point 1 on Axis 2 

(3.35 - 3.26) X (3.78 - 0.4g)/3?35 = '0.09 
* (4.39 - 3.26) X (3.49 ■^QAS)I439 = 0.77 
(2.|0 ^. 3.26) X (0.85 - 0.48)/2.80 = -0-06 

0.20 is mean moVe (alpha = 1 .00^* * 
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ERIC- 



Point 2 on Axis 2 
- (3.35 - 3:26) X fa48 - 3.78)/3.35 = -0.09 
: (2.65 - 3.26) X (3.49 - 3.78)/2.65 0.07 
(3.68 7 3.26) X (0.85 - 3.78)/3.68 = -0.34 

-0.09 IS mean move (alpha = 1 .00) 

Point 3 on Axis 2 

(4:69 ~ 3.26) X (0.48 - 3.43)/4.3S T.-0.77 
(2.65— 3.26) X*(3.78 - 3.49)/2.65 -^^0.07 
(^.67 3.26) X (0.85 - 3.49)/2.67 0.58 



-0.07 is mean move (alpha = 1 .i 



00) 



Point 4 on Axis. 2 ' t 
(2.80 - 3.26rx (0.48 - 0.85)/2.80 = O.Otf 
(3.68 - 3.26) X (3.78 - 0.85)/3.68 = 0.^4 
(2.67 - 3.26) X (3.49 - 0.85)/2.67t -0.58 

-0.05 is mean move (alpha = 1 .00) 



^ ' ' , CONHGURATION 

y 1 2 / 

1 4.364 0.685 

2 3.987 ' 3.694 
— 3 ' 1.295 ■ '3.420 

'V ^ 4 1.655 0.801 , 

'Compute the new distances from above configurations:' 



DISTANCES 







2 


3 


-0 


1 




3.033 


• 4.111 


- 2.712 


*2 


3.033 


0.0 


2.707 


3.716 


3 


4.111 


' 2.707^ 


0.0 


2.643 


4 


2;? 12 


^716 


2.643 


0.0 



Compute the new disparities 
^ DISPARITIES 



1 2 ■ l3 . "4 

1 0.0 p.033 3\1^ 3.178 

2 3.033 0.0 3.178 3.178 

3 3.178 3.178 0.0 ' . 3.178 

4 -3.178 ^.178 3.'178 O.CL * 
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IV. Compure the new stress: 

> ' STRESS 

/,i:89 . 
5, =V =0.1750; 



S, =V =0.1 750; (82 = 0.9954) 

'61.51 

(Compare with previous stress value of 0. 1 897.) 



ITERATIONS 
I. THE NEW CONHGURATION 



.1 

2 
4 



^ 1 
0.0 
2.781 
3.91^ 
2.666 



' 1 ^ 
4.3d6 
4.02a 
1.333 
1.641 



II^JISTANCES^ 



' 2 

2.781 

0.0 

2.695 

3.692 



2 

.0.835 
3.60 k? 
3.385 
0.779 



3 

3.916 
, 2.695 
.0.0 
^2;625 



4 , 
2.666 
-^11.692 
.2.625 
"0:0 



in . DISPARITIES ^ ^ 



1 

2 

j4 



1 

0.0 , 

2.'781' 
3.119 
3.119 



2 

2:781- 

p.o 

^3.119 
3.119 

IV. STRESS 



3 ^ 
3.119^ 
3.119 
0.0 • 
3.U9' 



* 57.96 



4 . 
3.119 
?3.119- 
3.119 
0.0 



0.-1658r(S-= 0.9714) 



Further iterations are possible (and desiraSle since they will result^ lower stress^, 
yalues; and all necessary information is given above for the reader to continue these C 
rteralions..As a che^k on computation the results of two iterations along the pathjp 
. to the final solution are given below. \ , . / ' \ f 
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i^RATIONll 
I. CONHGURATldN 



1 

2 
3 
-4 



II. 



1 

1.237 
^.213 
2.638 



2 

1.633 
2.867 
3.404 
0.696 




1 

0.0 
1.237 
2.883 
2.883 



III. DISPARITIES 

3 

?.883^,: 



2^ 
1.237 
0.0. V 
.2.883 
2.883 



2.883 
0.0 
2,883 



2.§a3^ 

2.883 ' 
2.883 
0.0 . 



IV. STRESS 



Sj =0.091^;(S2- 0.3731) 
ITERATION 20*. 



1 

2 

a. 



I. CONFIGURATION 
1 



*4.072 
4.039 
1.483 
1,706' 



' 2 
' 2.646 
, 2.449 
■ 3.455 
, 0.649 



II. DISTANCES 



.2 
3 

A. 



1 

2 
3 
4 



•1 '2 3 . 

0.0 .,0.404 -^2.9^ 

0.404 0.0 s2J47 

2.947 ' 2.747>^\' 0:0 

2.747 2.947 2.81S 



m. DISPARITIES 



1 

0.0 

.0.404 
041 
2.841 



2 

0.404 
0.0 
2.841 
2.841 ' 

^9 



3 

#2.841 
2.841 
0.0 . 
2.841 



1*7,00050 



■ 4 
,2.7^47 
2.947 
2.815 



.4 

'2.841.r, 
2.841 
2.841 , 
0.0 . 



IV. STRESS 

J S, =0.0317(82 = 0.0904) 

FINAL SOLUTION 



After 30 iterations, stress was reduced to a value of 0.008S, Other results were as 
shown below: * ^ 

CONFIGURATION 

. 1 2 

1 4.054 2.194 " 

2 .4.046 2.301 ^ - , 

3 1.488 3.465 
• 4 1.713 0.640 

DISTANCES 

1 2 3/4 

] ^ 0.0 0.108 2.864 2.810 

' 2 0.108 0.0 '* 2.810 2.864 

3 2.864 2.810 0.0 2.834 - 

4 2.810 , 2.864 2.834 ^ 0.0 • 

- . ^ DISPARITIES. 





^ 1 


2 


3^ 


4 


1 


0.0 


0.^08 


2.837 . 


2.837 


2 






2.837 


2.837 




^ 2.837 


2.837 


0.0 


. 2.S31 


4 


2.837 


2.837 


2.837 


0.0 






stress' 




* 






0.0085 ;(S2 = 


0.0216) 


> 



4. The problem of the initial . configuration.* A » recent ^improvement im 
multidimerisional scaling algorifhms has resulted from the search for a nieflhod pf 
deriving an initial configuration that wouJd'^lje closer to the final solution 
configuration than an.arbitfary or-raadonj'^one [6"2, pp. 1.8-20]. There are Two 
reasons 'to support suj:H ^ searc^i. First;.4t .*i? clear that fewer iterations of th^*, 
algori,thhi wilf 6e^eqilijed if the "initial cqtffiguration is clese to the solution con- 
figuration, thus, a CGrhsiderablq saving in computing tifrt^ fs achieved. SeconcLsince— 
the method^of computing the monotonically transformed distane^Slfdispd|rities) is a 
weak YnonoVonic transformation in that it often results in the tying of untied^ 
original similarities data, and since'the goodness-of-fit criterion is a measurement of 



the reduction m the squared differences between the configuration distances afid 
the disparities, degenerate "solutions" are possible in which stress is very Small but 
in which only the weak-monotonicity^ requirement h^s been met (43, p. 16] . (A 
strong monotoniL requirement is one in which the monotonically transformed 
values ifiaease when the original similarities increase and are tied when the simi- 
larities data are tied. This uritenon is met by Guttman's rank-image transformation 
with the addition of his strong-monotonic requirement [18, pp. 479-452].) The 
different methods of determining initial Lonfigurations m the vauous multidimen- 
sional Scaling apjwGaLhes will not be reviewed fiere. Discussions are available else- 
where [43, p. 6/57, pp. 254-258; 62, pp. 18-20]. However, the advantage of 
beginning 4he^r I es of iterations outlined m the First Example with an initial 
configuration cluser^^to the find] solution configuration can be illustrated by retum- 

^ ing to the First Example and recomputing the.solution adopting this approach. The 
first stage, in .this case, is tp 9pply the metric method of Torgerson [59, pp, 
254 258] to the initial similarities to compute the initial configuratJon*(^urg^6). 
This method consists of computing the two (since this is the proposed dimensional- 

Mty of the solution being sought) largest eigenvectors of a matrix of scalar products 
computed from the mitial similarities data. * 



Figure 6. The Initial ^Configuration Derived from the Metric Approach of /Tor- 
gerson on the Original Similariyes 

^ . ■ ' 41 , . j 
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ORIGINAL SIMILARITIES (see Table 5) 

12 3 4 

1 0.0 1.000 2.000 6.000 

2 1.000 0.0 3.000 4.000 

3 2.000^ . 3.000 • 0.0 5.000 

4 6.000 4.000 . 5.000 0.0 

^ I. CONFIGURATION ^ 

' ^ . 1 ' 2 

1 -0.589 ' -*0.127 

2 -0.073 . -0.i25 
V 3 -0.288 <^ . 0.425 
- 4 ^ ^ 0.951 0.025 

- II. DISTANCES . 









. 1 


2 


3 . 


4 






1 


0.0 


0.553 


0.629 


■ 'i:548 








0.553 


' 0.0 


0.780 


1.082 






3 . 


0.629 


0.780 


0.0. 


1.302 






4 ■ 


1.548 


1.082 


1.302 


0.0 *' 


t 


k 


4 


) 'lir. DISPARITIES 










1. 


2 a' 


3\ 


4 ^ 






1 #0.0 


0.533 


0.629^ 


. ■r.548' 






2 ^ 


^ 0.553 


0.0 


0.780 


1.082 






3 


0.1629 


. 0.780 


0.0 


1.302 






4 


. 1.548 


1 .082 


1.305 


. o:o' 










. IV: STRESS 







Sj =0.0. 



With the First Example, the initial cdnfi'guration using Torgerson's metric 
method yields a better splution tttm |he non-rnetric'' algorithm described earlier 
after 30 iterations. rfThis resuh supports the conclusion that a.better final solution 
will more likely be found if such a method is used to deriv-e an initial configuration. 
The First Example is artificial ii^that it was derived from metric data, and therefore 
it should^ not be surprising that a metric ifTTthod can successfully recover the 
original conTjguration from which the original similarities were derived. However, in 
more realistic examples, it normally is the case that substantial improvements are 
made to the metrically derived initial configuration by the non-metric algorithm 
such 9s*the one discussed previously: ^ 

. * • IP 

B. The Second Example * 3 ' • 

This worked example, differs from the previous one that the similarities* 



^ betweerpoints with which tfie scaling begins are derived from field questionnaire 
responses. The existence of a configuration is thus a hypothesis, andlts true dim^r^- 
sionality is afso unknown at the outset. The similarities data result from a paired- 
comparisons treatment of the towns chosen by a random sample of Iowa rural 
households, for major grocery expenditures [45] . This^ study is'described in gteater 
detail in ^ later section of this paper. Specifically, the data indicate.the absolite 
value of the difference from 0.5 of^the relative frequency with which towns at one 
. range of distance and belonging to a town-size-distance class are regarded as stimuli 
Since the stimuli can be decomposed into the two componentrpartf of town size 
and distance, the purpose oX the scaling approach is to det^mine the relative 
trade-off between the^o^components. flius, we wish to answer the following 
question for any^nd all comparisons. How much nearer or fufther should a town 
of a given size Begirt order to be just preferable to a second town of a given<size at a 
given distance? , 

In cQmputing these disparities, a'* problem resulting from ties in the original 
similarities is encountere'd. This problem did. pot occur in the First Example, 
though in practice it is a very commoii^one. It is solved here using the *>imary 
approach'' [43, p. 9] . In this approach, an order relation within a tie is determirijfd ' 
from the numerical order of the corresponding distances from the .configuration. 
This is in contrast 4o the "secondary approach'' in which the distances' from the 
configuration are first averaged into a block corresponding to any tied valuejs in the 
original similarities data. Sinc^e the primary approach may result in disparities that 
are Afferent, for tied values in the original data, wherfeas t)iVcahnot happen with 
' ^the' secondary a^proach^ the foVraer^is known as the weak monotonicity appj^ch. 
and the latter is knoivn as the senii-weak monotonicity approach, [43, p. 1 1] . ^ 
The orijginal similarities for this Secdhd Example are shown in Table 12 below 



-TABUEJ.2^RIGINAL SIMILARITIES 



1 

2 
-3 
4 

.5- 
'6 



coordinates: 



Q.O 


' 0.430 


0.330 


0.140 


. 0.400 


0.346 


0.430 




0.500 


0.410 ' 


0.50D 


0.470 


0.330 


d.soa 


0.0 


0.370 


0.300 


0.120 


' 0.140 


* 0.410 


0.370 


0.0 


■ 0.500 


0.390 


0.400 


0.500 


0.300", 


0.500 


. . 0.0 


^0.500 


■'0.340 


0.470 


0.1 2Q 


9.390 


0.500 


0,0 





Point 




Random 
Coordinates 




1 

■ "2 




'* '^^ 0.626 ^ 






0.940 




3 




. 0.^93 




4 




0.713 




, 5 




^ * 0.497 




6 




^ . 0.907 




The distances a«d disparities for this initial configuration are shown below. 

INITIAL DISTANCES ' 





1 2,3 


4 


5 


6 * 




1 


.314 .133 


.087 ' 


.129 


.281 * 






, a '447 


.227 ■ 


443 


.033 




• . 3 * 




!220 


.004 


414 




4 ^ 






.21.6 


.194 




5 








.4J[0 


** 


























* * » 


1 INITIAL DISPARITIES 










i ^' 2 3 


^ . 4 


5 


6 ' 






.199 .160 










• 1 


.160 


, ' .199 


'. 1 99 




2 


.447 


.199 


.443 


.199 








.199 


-.160 


.160 




4 ' . * 






*• .216 


.199 




' 5 




• 




.4J0 




6 
























, ITERATION 1 




0 - 





ERIC 



Point" 1 on Axis 1 ■" / 

2 (0.31 - 0.20) X (0.94- 0.63)/0.31 = 0.12 
3- . (0.13 ^0.^6) X (0.49- 0.6?)/0.13= 0.03 
*4 . (0.(39 - 0. 1 6) X (0.7 1 - 0.63)/0:oa = -0.07 
5» (0.13-0.20^X (0.50- 0.6'3)/0.13= 0.07 
^ . (0.28 ~ 0.20) X (0.9 1 - 0.^3)/0.^8 = 0.08 

0.94 is mean move (alpha ==-1 .00) * p 

>^^Point 2 on Axis 1 % * ' 
' 1 (0.31 -0^^(0.63- 0.94)/0.31= -0.1 2 > 
' % ■ (0.45 - 0.45) X^(0:4?,- Q.94)/0.4^ = 0.0 
4" '^^(0.23 - 0.20) X (0.7 1 -,0i>4)/Q.23 = -0.03 

5 (0.44 - 0.44) X (0.7 Ir- 0.94)/0,44 = 0.0 

6 (0.03 - 0.20) X (0.91 - d.94)/0.03= 0.17. 

0.00 is mean move (alpha = 1 .00) ■ 
Point 3 on Axis 1 

1 " (d. 1 3 - 0. 1 6) X (0.63 -.0.49)/0. 1 3 = -0.03 

2 (0.45 - 0.45) X (0.71 ^ 0.49)/0.45= 0.0 
4 (0.22 - 0.20) X (0.94 - 0,49)/0:22= 0.02 

. '5 . (0,00 - 0.16) X (0.50 - 0.49)/0.00- -0.16 . 
6 " (0.41 - 0.16) X' '{0.91 - 0.49)/0.41 = 0.25 



* O.OZis^^n move (alpha = 1 .66^ 




Point 4 on Axis 1 ^ , 

1 (0.09J-0.1j6)X (0.63 - 0.7-1 )/0.09^= 0.07 
\2 (t).23 - 0.20) )<: (6.94 - 9.71 )/0.23^ 0.03 

3 0 (0.22 - 0.20) X (0.49 - 0.7 0/0.22 = -0.02 
^^ 5 . (0,22 - 0.22) X (0.71 - 0.7 0/0.22= 0.0 ' /' 

6 (0.19^ 0.20) X (a.91 - 0.7O/0il9 = -0.00 

• ' . * V ; ■ ^ - 

^ 0.01 IS mean move (alpha = 1 .00) 

Point 5 on Axis^K 

n (0.13^.20) X/(0.'63 - 0.50)/0.1 3 =-0.07 

2 (0.44 - 0.44) X (0.94 - 0.50)/0.44 = 0.0 

. 3 • (0.00 -0.1^)X (0.49 - 0.50)/0.00'= 0.16 

. 4 (0.22 - 0.22) X (0.71 - 0.50)/0.22= 0.0 

^6 (0.41 - 0.41) X (0.91 - p.50)/0.41 = .0.0 " 

• * 0.01 is mean move (alpha = 1.00) 



Point 6 on Axis 1 

1 (0.28 - 0.20) X (0.63 - ,0.9 1 )/0.28 = .-^0.08 

2 (0.03 - 0.20) X (0.94 -.0.9 O/0.03 = -0.1 7 
^Z: (0.41 - 0. 1 6) X (0,49 - 0.9 1 )/0.4 1 = -0.25 

4 (0.19 - 0.20) X (0;71 - 0.9 0/0.19= 0.00 

5 (0.41 ^ 0> Ox. (0,50- 0.9 0/0.41 = 0.0 * 



I. CONFIGURATION 



1 



1 , 0.663 

2^ , 0.944 

3. ; 0.508 

'4 - "^0.725 

5 ^ - '0.511 

6^ ■ , '0.824 



• II,- DISTANCES ^ 

1' 2 '^^'3 .4' 5 6, 

a ,0.0 0.281 0:i5>5 0.063 0,151 0.161 

2 0.28; t 0,0 0.436 0.218 0.4"32 0.120 

.3 0.155 0.436 0.0 / , 0.217 .0.003' 0.316 

4 • 0.063 0.21-8 '0.217 , 0.0 . . d.214 ' 0.099 

5' 0?(51 ' 0.432^ .-0.003 0.214 0.0 0.313 

6 '.pj61 0.120 0.316 0.099 0.313 ' 0,0* 



ERLC 

hninniiiTirn'aaiia 



45 



00056 



9 




1 


2 


0.0 


0.026 


0'-206 


O.O 


0.155 


0.349 


0.127 


0.206 


0.157 


0.349 


O.I 57 


0.206 



L DISPARITIES 

' 3 ' 4 

OA 55 0:127 

0.349 0.206 

0.0 0.157 

^0.157 O.O 

' 0.127 0.349 

^0.127 0.157 

rV. STRESS 



5 6 

6.157 0.157 

0.349 0.206 

0.127 0.127 

0.349 0.157 

0.0 0.349 

0.349 0.0 



S. =V.ll = 0.3497 
.89 



ITERATION 2 



LCONRGURATION 



I 



1 • . ' 0.666 

2 ^ ^ 0.9l5 

3 ' ' 0.544 

4 0.741 

5 0.517, 

6 0.793 



II. DISTANCES 





c 










1 


2 


3 


4 


5 • 




0.0 


;0.249 


%Q.123 


. 0.075 


0.150 


0.127 


0.249 


' lo.o 


0.372 


0.174 


'0 399 


0.122 


0.123 


0.372 


0.0 


0.197 


• 0.027 


0.?50 


0.07i 


0.U74 


o:i97 


0.0 


0.224 


0.052 


0,1.50 


0.399 


> :0.027 


0.224 ' 


0.0 


0.277 


6.127 


0.122 


0.250 


0.052 


0.277 


0.0 






III. DISPARITIES. 






1 


2 


3' 


4 


-5 


6 



0.0 0.186 0.123 ^ 0.117 0.150* \.0.I26 

0.186 0.0 \ 0.3'18 0.174 0.318 -"0.186 

0.123 0.318 - 0.0 0ff26 0.117 0.117 

0.117 0.174 /0.126 0.0 4L. 0.318" 0.126 

0.150 6.318 '0.117 0.318* 0.0 0.318 

0.126 0.186 ^ 0.117 0.126 ' 0.318j^'0.0 



IV.STRfeSS 
Sj =0.3081 
ITgRATIONS 
L CONHGURATION 



I 



1 ' 0.681 

2 . 0.873 

3 0.617 

4 . . ^ 0.733 

5 0.508 

6 0.764 

' IL DISTANCES 





1 


2 


3 


' 4 . 


5 • 


6 


1 


0.0 


' 0.192 


0.064 


O053 


0172 • 


'0,083 


2 


0.192 


0.0 


0.256 


O140 


0365 


O109 


3 


0.064 


0.256 


- 0.0 ■ 


0117. 


0108- 


0147 


4 


0.053 


0.140 


0.117 


OO 


0225 


O.OSl 


5 


0.172 


0.365 


0.108 


"022S 


OO 


0J56 


6 


0.083 


t).109 


0.147 


(f031 


0256 


0.0 








nLTOtSPARITIES • 








1 


2 


3 


4* 


5 


6- 

> 


1 


0.0 


0.153 


^0086 


0.068 


0153' 


O086' 


2 


' 0.153 


0.0 


0275 


0.153 


0.275 


0.153 


3 


0.086 


0.275 


0.0 


0,0S6 


O086 


O086 


4 


: 0.086 


0.153 


O086 


0.0 


0.275 


0.086 


5 - 


0.j53 


0.275 


0.086 


0275 


OO / 


0.275 


6 


0.086 


0.153 




O086 


0275s 


OO 



rv. STRESS 



Sj = .230 



After five iterations, the^ue of stress was 230', no ftirther reduction in this 
value took place during the next 25 Iterations* Substantive interpretation.of the 
resulting cpnfiguration is facilitated by Figure 7 where the scal^ values are iso- 
pfethed for graphical display of the tradeoffs between the two conflicting stimuli of 
distance and town size of consumer movement. From4his'figure it is apparent that 
■'for the purposes of making grocery expenditures, towns of less than 1^00 popula- 
tion, owe to proximity much of the patronage which they receive from th^ rural 
population. In .general, the Iowa rural population substitutes stores in small towns 

er|c . ■ 



at close "distances for stores m larger towns at greater distances. The advantage of 
the result^ of the scaling procedure is that, for the first time, we are able to 
determine exactly the cir^untstances iii which one town v»ill be more favorable than 
another for any and all rural locations. 



2000 



o 



3 ' 

CL 



o 



iOOO 




* Distance to Town 
(Miles) 



Legend: Point 'numbrfk^reference the stimuli numbers in 
the tabfes in^ which scale vafues are compoted. 



Numbers*!^ boxel'^are Jcale values after the 
Bth iteration. 



Figure 7. Isopleth of Scale Vahies After the Fifth .Iteration Interpolated Pref- 
crence Stnicturc . • ' * 
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m. GEOGRAPHICAL APPUCATIONS OF MULTIDIMENSIONAL 
SCALING ANALYSIS • ^ 

The varietv of actual and potential applications of MDS is large and ^ges over 
man\ subsets of the fieW .of geograph>. Perhaps the Simplest wa> to. arran'ge 

. examples is to .divide them into those that use. simple space configuration (i.e. 

• configurations of onl> the stimulus objects) and joint space configuration 0^ those 
which map the configurations of both individuals and stimulus objects). 

A. Simple Space Outputs , 

We have constantl) stressed that MDS programs can accept both metnc aud 
non-metiiL input, it seems resonable, therefore to gjve specific examples -of each 
type of research problem. . . 

1. Map transformations, Probablv one of the most quoted examples of metnc mput 
to a MDS program with simple space Output, is Greenberg's "Roadmap*\problem 
[ f 71 . The basiw data here were the* inter<itv road mileages between ^1 pairs of 1 5 

. cities in the United States, that is, 105 or n(n-l) ii1ter<it> gfctances. The iata wiere 

'2 ^ 

, interpreted as similarities. data*b> ranking the distances with the shortest road 
ditonce represerjled b> rank I and the largest distance b> rank 105^ The output 
consisted of a configuration of points in two dimensions (north-south and eastj^ 
wesil Discrepancies between predicted configurations-obtained from a non-metnc 

* multidimensional scaling program- and actu^ configurations were for the most part 
small and. could -be accounted for b\ the simpTe fact th^ road distances ^are ffe- 
quentlN not the shortest distances between places but reflect j detouri^mmnd 
natural and nifen-made barriers. IhsQther words, locations outputted^by the program 
represent "rrue" locations if all toaOcDnnections between the pairs of crties .were 
straight lines. The solution here represents a t>pe of map transformation similatio 
that which would be achieved if the places were located on an elasfic map and 
joined bv Imes represenung actual roadwavs, and if the elastic w^re then stretched 
in each directiop until all road lines were straightened out. . .\ 

Note that in this example where metric input is used, the first step in the 
Kxu^al algorithm is to convert the metnc data to non-mctnc (ranked) form. Thus, 
'the final vonfiguratiqn erf points is obtained from non-metnc infbrmauon. this is 
true of all analyses in non-metric multidimensional scaling. How&ver, where the 
researcher can have confidence in the rnetnc information he begins with, it is often 
adv^tageous to resort to, a. metric multidimensional scaling. Tobler [54] ha? 
pointed out that the problem of constructing map 'projections is pne m Which the 
final metric is usually known to be Euclidean and the number uf dmiensions kn(twn 
to be two .\^cordingly, he argues that empirical nwp projections might be designed 
to produce, from a matrix of empiric^U distances' between points, the "best-' con- 

• figuration of the points in Iw^o dimensions such, that the sum of the squares of the 
difference between the onginal distances and the resulting configuration (inap) 
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distances are minimued. In other words, ins^tead of beginning with a definition of 
properties in the abstract that must be preserved, his procedure would be to begin 
with observed distances between points in the space and desigft a projection to best 

* rephcate these measurements. Aftei^he had computed the 2,080 sphencal divances 
between a set of 65 regularly spaced latitude and longitude intersections covenng 
the United States, Tobl^r used Torgerson's memc multidimensional scaling (57, pp. 
255-258] to denve the plane map projection coordinates for the 65 points. Conv 
panng the distances measured from this recovered configuration with the origirfal 
spheni^al distances, he showed that the distortion' values were generally less than 
two per cent. These distortion values, he concluded, compare favorably with those 
on Albers* and Lambert 6 conical projections with two standard parallels. 

In a second example of the use of scahng procedures to construct maps, Tobler, 
Mtelke, and I>etw>ler staled the geobotanical distances between New Zealand and 
some neighbonng islands using inter island dist^ces mferred from a model of the 

. diffusion of plant speaes 155): Basically, the authors attempted to examine the 
degree to which flonstic similarities between New Zealand and its neighboring 
islands (of which there are eleven m all) could be explained by two geographic 
factors, (a) the relative position of the islands and (b) the siie of the islands. The 
cntical qliestion was what proportion of thf commonality of plant species could be 
explained fully by these two factors? The model they constnigjed attempted to 
answer this question by usfng floristics relaqons^to define geobotanical distances 

. which wre then compared statistically with (he islands' relative locations on the 
siirface of the earth^In other words, they attempted to identify quantitatively the ^ 
flonstic relations of localities. They fiien used these relations (expressed by the 
number of species common :o pairs of isjands) together with island, size and 
assumed interaction between islands to draw their geobotanical map. The distances 
between islands on the map were then compared with great circle distances to give 

. some measure of the model's worth. Both the geobotanical distances and the^at 
circle distances were inputtedlnto^a multidimensional scaling program (Gutdnan- 
Lingoes SSA-l) and outputted in a two-dime nsional EucUdean space. In this way,y 
the authors obtamed an empincal map projection which preserves positional rek^ 
tions in the least squares sense more accurately than any other possible map projec- 
tion. The acmal fit of the maps to the distances wasapproxim'ately 985. In specify- 
ing the output configuration in thes^ tenns^ the authors simplified the 
* ints^p^tation of the dimensions of each configuration fgr they 'represented merely 
the north, sfxjth, east- and west dimensions common taany other map projection. 

D. G. Kendall [23) has exterrded the concept of recovering spatial coordinates 
to that of recovenng temporal order. He has shown how multidimensional scaling 
^ can be used to recover the temporal sequencing of a set of data in wiiich it is 
hypothesized that events occurring at a point in time contain information on occur 
" rences that are known tor have been "in vogue*"* over periods oF time. In his 
'example. tKe objects are tombs and the occunences relate to the* presence dr 
absence of vane ties <jf objects. The input square similarities matrix (of totnbs) is the 
number of vaneiies common to the i^^ and j*^ tombs. ^Kendall shows how with 
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*jK>th this data as well as with hypothetical data, the recovered configuration in two 
dimensons'is a horseshoe form- Ordering the tombs from the' order in ^ch they 
^^peaf as one moves around the horseshoeMeads to a seriation of the lombs that is 
stHldiltJy similar to the serial positions of the tombs derived by classical archaeo- 
logical seriation principles. This raises the intriguing possibility for historical geog- 
iaphers of designing research studies such that one might simuitajieously hope to 
recover a spatial*temporal series. Wilkinson [59] has followed up Kendall's studies 
* of seriation in archaeology using MDS by analyzing abundance matrices (i.e. 
matrices in which the values are non-negative and are weakly .iinimodal in each 
cohimn) for the occurrence 6f Hamiltonian circuits. Hamilton^ circuits are de- 
fined as re-entry paths passing throu^ each of m vertices in a linear graph only 
once. In the examples Kendall studied, "graves vs. artefacts** matrices*were com- 
pflei TOkinson argued that the length of circuit in such matrices represents the 
omianges of fashions in neighboring graves and allows for the development of 
a measi^ for the overall rate of change during a ^en period. The minimal Hamil- 
tonian circuit pves the niinimum period for a series of changes to take place. For a 
given set of similarities data, finding this circuit involves ordering the terms column- 
forcing a solution iii two dimensions, preserving the order of interpoint dis- 
t^ces well asi possible" [59, p. 14], and then observing^if there is a clear 
Hamiltonian arc. The presence of suph an arc represents a check on the seriation in 
a configjration produced by MDS methods. i 

2. Prefe^mces for poUtidans. The uses 9f MDS in political science and political 
geography afford another illustration of simple space output, but this time non- 
metric input is used. Consider a situation where subjects are asked to state their 
preferences for political candidates. In an experiment conducted on 1,000 members 
of the Consumer Mail Panels and a selection of sociology students, Johnson [21] 
askM for portions on 35 political statements and selection of two from a Est of 14 
prominent political figures. It was suggested that rankings obtained from such 

, preference data could be converted to their implied paired comparisons and, by 
add^g over sample members,, paired comparison proportions could be obtained 
wtuch were then summarized by Bradley-Terry scale values. These scale values are a 
colfection of numbers which sum to one, having the characteristic that the propor- 
tion of individuak preferring stimulus "A" from among any collection of stimuli is 
estimated, by the scale value of ''A** divided by the scale values for the whole 
collection of stimuli. * ^ ^ *^ 

The resulting similarities data could then l;e inputted into a standard MDS 
program, a configuration similar to that produced by multiple discriminant analysis, 
as shown in Rgure 8, would then .be obtained. The naming of the dimensions in 

^ Figure 8 was based on an examination of the responses which subjects made on the 
extensive questionnaire conceming policy problems. Figure 9 shows the other 
dimensions which Wiere inherient in the initial ques?tioning and^e position of each 
political figure in relation to those diihensions. Apparently, the configuration could 
be. recovered m^two dimensions with the "liberal/conservative'* and "government 
'* ♦ - ... . ' 
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involvement" axis appearing to sumniarke best the whole range of possible axes. 
From the 'final configuration, of coureg, metric distances between pairs of points 
can be calculated and statements made as to the perceived "distance apart" of 
individuals on each of the dimensions. • * . . " 
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Figure 8. Positioning of Political Figures in Terms of the Two Key Dimensions- 
(After Johnson, 1969) • ' 

3. Perceptual stu(fies. Perceptual distance is another topic of study that appears 
eminently suited to analysis by MDS techniques (with simple spa^ output). Two 
specific examples ^ the ^lledge, Briggs, Deniko study of intra-urban distances 
[14], and Whipple and Niedell's study of black and white perceptions of stores in 
Buffalo (N.Y.), [58]. - . ' 

In the study of intra-urban distances, subj^ts (all located at one point) esti- 
mated distances for the n(n-l) pairs of locatiorts selected for the study. The esti- 

• mates obtained in this w'aj^ were interpreted as dissimilarities data, and the Kruskal 
EV MDS program was used to produce a configuration based on iriterpoint dis- 
.tarices. Figure 10 shows the configuratfon of points derived from the subjects* 
estimates of location. Sjnce the scale o? the an.alysis was quite Small, considerable 
accuracy of distance estimates was obtained by some sample members. BreakJhg the 
whole sample down into twogroupshasedon length of residence, and the distances 
into subsets "toward the CBD." and "awi^Jfroni CBD" revealed some interesting 
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trends (Figure 1 1). For example, the newer of the two groups was typified by much 
poorer estimation of distances, indicating that lack of familiarity .with locations 
(iistorts their estimates of distance and can obviously influencie behavior with 
respect to these locations. Members of the second group showed improved ability* 
to^ocate all features accurately. The variance between the two groups was seen as 
an indication^.pf different familialrity levels with the city, with corresponding dif- 
ferences in the rates of forming travel habits, and differences in the choices of 
orientation nodes ^bout which mental images of the urban area were built up. 
y The other significant .feature derived from this analysis was the tendency to 
exaggerate distances toward the CBD. This in tunl suggests that increasin| conges-^' 
tionX/hd travel time) tend to increase the perceived distances between places,'and 
^^'^«\.,/>s.^,jh5tUie denser packing of land uses ar.ound the CBD makes distances appear longer 
and individuai'pjac^s harder t(>^locate precisely. ^ . * 

Conclusions drawn from this study were that interpoint distanpes which are 
over-estimated probably .reduce the likelihood of jnteraction between points. Sug- 
gestions were made 2^ to the likely effect of distorted distance perceptions on 
things such as pfaces chosen to shop,. recreate, and establish residence, ^t seems 
re^onable to assugie also that further studiSfof this type will throw considerable 
light en the relationships between perceptual accuracy and movement, and on the 
effects that changing configuration (resulting ftom information changes) haye on 
urban spatial behavior. - . ' ' 
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Figure 10. Actual and Calculated Locations of Sample^ Points, The actual map is 
compared to Ihe configuration derived from^dis^imilarity measures consisting^of 
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, Whipple and Niedell's study of black and white perceptions of various stores in 
Buffalo (N.Y.) prpvides the geographer with an interesting framework for the 
analysis of perceptual distances. The authors initially used* a semantic differential 
scale, to obtain a ranking of ten department stores (based on "favorableness") by 
black and white respondents. Frequencies of visit to each store Were also collected 
and again stores were rank-ordered for the w^ole sample. IpdividiJpal semantic dif- 
fereatial scores were transformed to a "distance" measure using the following for- 
mula: ^* 



^1. 



a= 1 



la ja 



where |d.| = absolute distance between a pair^of stores, 

* x.g - semantic score of word pair a for store (i), - * 
. and X. = semantic score of 'word pair a forfstore (i). 

The result was a distance matrix of perceived-similarities for the n(n-l) pairs of 
stores; this constituted the basic input to the MDS algorithm. The results of this 



analysis were most revealing: 



f " a) stores .that cluster togeth.er in the final configuration are more "cofnpeti- , 
tive" tjian those that are far apart; f „ ' 

b) black and white perceptions of the favorability of stores varied somewhat, 
but bverall'the perceptions were -quite similar; * 

c) both Samples did not necessarily shop at the places With the most favor- - 
able iryage; ^ ^ . f 

d) further study based on social and economic class differences showed very 
little variation i'a the perceived favorability of storesr i- . ^ ' 

yjblle thelstudy wasVidertaken in an integrated neighboi^iood and woulcf there- 
fore not exactly mirror variations in perceptions resulting from locatibnal segrega- 
tion, the methods and r^lts indicate that perceptual distanq^es between competi^ 
tors'may be Vuseful variable in consumer behavior, studies. A modification of this 
to find the perceptual distances of stores from consumers (using joint space pro- 
cedures) woifld probably be even more, useful *to the geographer. 

A further example of The use of multidimensional scaling in the simple space 

, sense is provided by Schwind [49]. Sghwinxi's interest is in the migration distances 
between states injhe United States, The basic input data are (lyadic in nature, and 

' the algorithm used is the Guttman-Lingoes smallest space analysis program. 

, Schwind generates interregional dissimilarities data on the basis of migrant moves 

. and produces a configuration of the states of the United States in which proximity 
relations are transformed somewhat on the bases of the migration inputs, Schwind 
analyzed both dyadii streams of movemenf (i.e. the net migration rates between 
* evefry pkir of state^D^n^ dyadic rates of movement. Here the ratio of net dyadic to 
gross dyadic migration is defined as: mjj = M^j ~ Mj| where: mjj is the dyadic ratio, 
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and M.J , M^^ represent the directionally oriented flows between any two states i 
andj, . 

, In this case again, only the lower half of the migration matrix was read in as 
data, and the solution was obtained under conditions of spmi-strong monotonicify 
and weak monotonicity. Results were produced for each djmehsion up to ten. 
Result? were interpreted both as dissimilarities and similarities data. Sch^nd noted 
that solutions based on similarities data had lower stress values than those based on 
dissiipaarities^data. He also inferred that a three-dimensional solution was most 
appropriate (by examining the Shepard diagram t)f the .result). Interestingly, 
Schwihd found dyadic net migration streams to be negatively associated with geo- 
graphic distance. He argued on the basis of these results that it is justifiable to treat ' 
dyadic net streams as simflarity data and dyadic net rates as dissimilarity data. The 
output from his study,inclu3ed: matrices of derived interstate migration distances 
in a space of specified ^imertsionality; the geometric coordinates of each state on 
each recovered dinienjicKtrthe distance of each point or state to the »erigin of the 
r-dimension space; arf^raphic presentation of the position of states injhe mi|ra- 
tioh space (see 'Fieurtj; 12 anf^ni • J' ^ Q- 
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' Figure 12. Migration Spaces: Similarji»s-Dat a (Source :^Sciiwind, 1971) 

* Schwind's paper is an interesting one, for it etnphapizes one of the major prob- 
lems involved 'in multidimensional scaling an^lysis-the interpretation of dimen- 

• sions. Upon examining the geometric coordinates of states on the^covered dimen- 
. sions; Schwind argued that the coordinates^did not seen) to reflect any" obvious 

scalingill^f states on the bases of incom"e,\irbanization, climate, and so on>He did 
^ not, however; attempt tQ* correlate the coordinate values witii any scale Values 




derived for the Jppropriafe explanatory vari^les. He ,did, pn th& oth?r tjand, 
attempt to interpret the laiatrices of derived distances^of states from the ori^n of ^ 
tbe migration space. Agaul his intuitive interpretation was that "distance to origin'* 
values seemed to suggest that states known for high rates of in-fhigration^are close 
to the center of a migrant's perception space, ^nd that states known for high rates 
of out-migfation are far tpward thejjeriphery. '-^ / ^ 

Another application of multidimetisibnal scaling analysis, this Uipe using a 
Kmskal-type algorithm, is seen in Gould's analysis of space preference measufes, 
with jrespect to the residential desirability' o^ various states in the United States 
(15, 16). Whereas Schwind used net migration rates between states to give him 
some indication 6f the ^similarity and dissimilarity of states, Goufd ootained pref- 
erence orderinjgs of the states from a sample of 25 resident graduate geogfaphy 
students at Pennsylvania State U;iivers!ty. In addition to-obt^ning these ordered 
data, Gould colleifed interval-scaled datar on the relajive advantages o' 
tially, Gould considered the point configucation of 51 states in a two 
Euclidean space. The arrangement of points in this space was interprete 
ing the similarity states over the range of subjecfe. The stress value, i' 
was .224 in 2 dimensions. Figures 14 and 15 indicate the point cortfi^rations with 
both interval and prdinal scale measures., The scaling devices produce 'interesting 
clusters ot states with perceived similarities/The ordinal scale produced a more 
.circular distribution of points and^onsequently one that was comparativ^y easier 
to interpret. Gould suggested tjfat the configuration resulting from tsh^ interval 

Removed to confjjrm.with copyright laws,. * " 
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Tex 



La • 

Al4 ( 



N. • 

Ark '•S, Ul , 
► ' ^'^V okU • 



s.-c..«i ^ 



• tenn 



Del 



Hon I • 
«%Uh • Uyo 



.He, • 

Mo^ . • Ohio 



N.iY 



Figure 14. Point Configuration of Q-Mode Analyls With Interval ScJales'^After 
Gould, 1969) . i : . : ' ^ ^ ^ r ^ X^^" 



Mill* 
Ark* 



• OkU 



s. ct 



,.f 1 W Va 



« * 



V. X 

H.'D» ' ' Ken 



R.I 



4» 



•N«v , Mont 

• Vyo 
•Alai * 

^ Ore«^olo 
, Kana 

' • • Waah 



% 

Htbt 



• N,0 

Minn 

* Wis - 



Mich 



Figure 1.5^ Point ponfiguration pf Q-Mode AnalySs' With OrdinabScales (After 



Gould, 



I. 



scaled data might be interpreted as a map of Amerip after it had been transformed 
Jnto "some highly distorted perceptual space." He argued that it was recognizable 
^ a map because states* that occur close together in geographic space tend t% be 
similar when perceived in terms of residential desirability. 
^ Gould then examined the problem of whether or not the interval scale configura- 
tion is simply the ordinal scale configuration which had been randomly disturbed. 
Fn order to examine this hypothesis, he measured the locational shifts between the , 
interval and ordinal point configurations, collected them at a common origin (simi- 
lar to the collection of migratory movements for the purpose of estimating a mean 
information field), and e^ined the angles of orientation." His conclusion was that' 
the distribution of angles can be considered as having been dra^ from a rectangu- ' 
* lar distribution.' He then regressed an index of social welfar^or each state against 
the pe^eptual score (§cale value) of each state. His conclusions (R2= 0.64> indi- 
cated that the pverall mentai map of the group reflects Ae variation, of rele.vant 
welfar^ measures to a high degtee. Further analysis of this regression provided ^pme 
interesting comments on the major residual values in terms of the underestimatipn 
of the^^fmagej/of certain states*. Finally, by finding th& configuration of.indiwliials 
in the sample he was able to check to see whether or ndtLn^lividuals ibcated close" 
together in his configuration had similar output configurations. for their preferences 
for states. Again he found a.high degree of correspondence betweerf the proximity 
of individuals in the configuration space and the configuration o\ states in the state 
configuration space. ^ • ' ' \ . ' 

One 5f the significant conclusions from this study was that the\prdinal produc- 
tioh of data provided h mor^easily interpreta^le and clearer confirmation than did 
the more rigorous interval scale data. * * \ 

. 4. Archaeological reconstructions.. For one final example of simple space output, we 
, can again return to the work of Tobler and Wineberg [56] . Using the (juttman- * 
Liljgoes smajlest space analysis program Tobler produced a map of pre-Hitti^e cities 
leased on^infonnation obtained from analyses of cuneiform tablets. Using so^ne of 
the assumptions inherent in the well-known gravity model, Tobler^hypothei^iz^d 
that th^ m^ore frequently a place^- was Recorded on these tablets the larger would-be. 
its'^e. Furthermore, the more frequently pairs of cities were mentioned together 
on the'same tablet, th^'greater tKe link betweeri them (either in a trade, or spatial^ 
sense). Based on these frequency counts, he compiled a>et of dis^srmilarity measures ^ 
\ arfd, using them as input, reproduced a configuration of the towns themselves. 
LSince the-^ocatlons of two of the to>yns were "known, he was able to orient his- 
output configurations in terms of latitude and longittjde and to suggest an approxi- 
mate location within ajadius of about 50 miles for the remaining, and hitherto 
unlocated, places mentione'd on the tabletsMhe essence of , thi^*study was to 
reproduce an archae^^Iogical or historico-geographif map of the location of places 
based on information iriputted from.a geographic model on their proximities. Infor- 
mation derived from the configuration may possibly then be used to choose among 
a large variety of alternative locations for archaeological e^cpeditions. Incidentally, a 



_ similar experiment attempting to reconstruct the locations of former places was 
^cunducted b> Kendally who used isiandardized inter-marnage rates for eight {Parishes 
in the Oimoot district of Oxfordshire, England between 1600 and 1850 as an index 
. of ,similant>. Using the Kfuskal MD-SCAL program he obtained a very accurate 
map ofOtmbpr-|'25] . . - 

While the above examples b> no means exhaust the range of uses for simple 
space MDS in geo^apKy. the> do give some idea of the't>pes of problems that can 
^ be examined and they refer Vo a variety of techniques that can be used In compiling 
input data. _ 

B*. Joint Space Output - ' ' - 

The use of joint space output appears, to have equall> as much potential use in 
geography as does simple space. The essence of joint spacfe output is that both 
ln4i>iduals and stimuli are mapfped into the same dimensional space. In this way, 
one caH obtain t^ie subject preference rankings and at the same time give metric 
mpaning td the distance separating individual stimuli. Concurrently, one can see 
how close to an individuaPs *'ideal*' any particular stimulus comes. 

1. Inter-urban, migration choices. One interesting use of^joifit space output can be 
^eh in*ttie work pf Demko and Briggs [8]' in their attempts to operationalize the 
choice behavior of migrants. The> argued that inter-urban migration is the outcome 
of a choice prbcess in which perceptions of the favqrability 6f each alternative 
destination is a sigmficanti'actor mflueVicing final choice. Using a sample of individ- 
uals in southeastern Ontario, they generated similarities jdata concerned with the 

^ perceptions^Of alfernative ixfban places and preferential choice data concerned with 
pr^erences for these places as migrant destinations. - 

Each individual was assumed to perceive each cityiis a union of attribute values. 
^By init|all> mapping the cities into a perceptual space (based on 'Similarities 
criteria), each^place^^ was given locational and distance characteristics somewhat 
different from ^kose it possesses in physical (objective*) space. In other words, 
places "Which have similar, coitibmations of percliived^attributes would be' close 
together in the .selecte^.r-dimensionaPspace, even though they ma> be far apart in 
objective space^ • , . . » }^ 

, The preference model fpr places is derived from a^multidimenslonal unfolding 
.procedure. Individuals rank order places on the basis of "utility? or some other 
criterion of preference, then the unfolding algorithm- defines an ideal pointVfor ea60 
individual by unfufdin^ his preferences jnid plotting the location of this ideal porpt 
in the same r-dimensional space as the similarities data were plbtteiL^?, pp. 
140-180] . Each city then hes a certain dis'tance^ from each individual, and a ohe- 

Af> dimenMuijal ordering ^>f4he relations between individuals and cities can be obtained 
Vsee Figure 16). Again additional in formation, can be obtained from the output by 
cltistenng the vitiq^us locations and intd'fj)reting which places, are likely to have 
similaVdrawing' powers for given migrant gro6pl ^ 



i?i^re 16, Preference, Space for Elmira Residents 
from Demko and Briggs (1970) , has heen removed- to 
conform with, copyright laws. It is a graph of cities 
and individual ideal points. . - v 




2>ScaIing spacf preference structures, Rushtoix has used scalrng techniques in an 
attempt to recover the nature o/ the^underl>ing trade-<?iTs between the various^ 
stknuli' that affect the iqciuonal choice of towns for consumer .expenditures by a 
sample of rural households [45] . His agproachis described in some df tail here both 
because it is illustrative of the \ise of the method of paired comparispn^ in generat- 
ing a proximity -matrix ^sjM^abie for pealing "by non-me/ric muhidimenisional scaling 
techniques^ an(Lbepatise the four coi^utational steps usjed in deriving the 
proximity maflrix nave been incorporated into an integrated computer prbgram 
[26]. Befoj^|escribing the four computational stages, however, a brief rationale, 
for thj^rese^^ner's interest in deriving such a ^cale is preseijted.-i ' , f 

ffour interest fies in V^dicting spatial choice from a set of alternatives, then we 
might view pbsery^d fhoi\;es as,the outcome'of a perceptu#|)rocess , whereby in- 
dividuals compare pj^rceivei^ alternatives with an ordering function of all conceiv- 
able opportunities so* as' to judge the most preferred alternative. Returning ^o 
reality it might thJn be argued that a sensitive treatment of the places chosen-as 
compared with th^se places pre'^nt but not chpsen-mjght leaS to the j-ecovering of 
the exact degree substitution of increases in fn^emeiTts of one variable pertain- 
ing to the places for increases in^ a second Variable relevant to the^ places. Only if 
such statements can be made for ^11 available alternatives, can we expect to predict 

.'choices from unique groups of available alternatives. The analogy with indifference 
carves in economjes.and preference structures in psychology has been made else- 

: where* [44] . In alt cases, the intent of research is to specify the function that orders 
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all conceivabl** alternatives open to the individ*^L Since this function pertains to 
the relevant stimuli, the basic problem is one of scaling the stimuli. 



a. Comffunng the input matrix. !1, Definition uf stimuli Stimuli ma> be defined at 
the outset with a sampling or other expenmental design arranged so that subjects 
are constrained to make vhoices from all alternatives, or objects might be asked to 
make vhoices from objects that are then assigned to stimulus groups. In cases where 
the researcher's pnor knowledge of relevant stimuli is vyeak, the letter design is 
.more appropriate. Rushton defined stimuli as vombinatioyis of dist^ce -separation 
between people and places, and functional complexity of places (estimated b> town 
population sizes) (Rgure 17). When one town was chosen m preference to a second, 
the generalization was made that the stimuh group ^ which the firs^ place belonged 
was revealed as being preferred to the stimuU^oup of the secoiid. The stimuli 
groups were called "ldc^tional-t>pes/' In one.^al>sis, 30 such types were defined. 
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Figure 17/ Definition of Locational Types (After Rushton, 1969 (451) 
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(2^ The basic data matrix. From a random sample (603 respondents) of the rural 
population of Iowa in 1960, infonnation was obtained on the places choseq'for 
expenditures on a riumber of commodities. Taking the place chosen for the major 
ity of expenditures for groceries, the following matrix was assembled. ^ 

TABLE 13. BASIC DATA MATRIX (HYPOTHETICAL) * 
* * Locational Type 



Household ID 


1 


2 


' 3 


4 


5 


6 , 


7 , 


. , 30 


I 


0 


*- 




« 


0 




1 


0 


2 


0 


0 




0 


0 


« 
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1 








0 


0 


.0 


• 


4 


0 




■'1 




0 


0 




0 




« 






0 


0 


-J 


« 


0 



Legend: Town in the indicated locational type: 

I p^ronized • j . . / 

* present," but not patronized 
Onot.present' ' . , 

k Computing the interpoint probabilities, A measure of the extent of preference 
for one locational type over another is the probability that one ty^ is chosen over 
another when both are present and one is chosen. Tljjs probabilitj^n be computed 
as a relative frequT^ncy by manipliIat?on of the basic -data matrix described above 
J45j.. ■ . 



c computing the interpoint proximities. From this measure pf the degree of prefer- " 
ence, a irtfeasure of perceptual, distance between locational types is required tha^^^ 
least in an ordinal sense, will indicate close; or fuilher perceptual distan^between^ 
all pairs of types. For this {jurpose, we use^^the premise jofCa'ttell that equally often 
noficed differences bci^weefl stimuh are eqyaNunles^alwai^s or never noticed. Con-, 
sequently , if one locational type is as fr^ouently preferred to a second typ6 as that 
typ^ is to the first (on the occasions wnen bo^h are present and one of them isi' 
* chosen) the overall perceptual distance between, the two types is zero. The per 
"'ceived d^tance-betNyeen any two types isfaven by: - , . , 

That IS, tKe perceptual distance from locatiorial type \ to type j is the absplute value 
of the difference from 0.5 of the conditional probability that locational type] is"^ 
jjreferred to !ype i when both are present and one is chosen. If follows that d.. = 
d.,. A matrix of such perceptual diistinces is an appropriate input matrix iox scaling 



particularly by non-metric; scaling tcjchniques, for the quality of the information is 
such that we are confident onl^ of the i;ank ordering of the interpoint distances.' 
However, Shepard (51 ) has shown that the rank ordering of all interpoint distances 
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in a matnx-implies the metric .position of the poi/its m a^pace of unknown dimen- 
si^nalitv and that, where the* nbmber of points .is large (e.g.. greater than J 5) and 
the true Jimensioft^it\ of the spate iismall* the freedT)m of movement in this 
Sf^ce is most resirivted rf the rank order of ihterpolnt distances between points in 
the metric spate is to correspond with the rjnk order in an iriput matrix. Hence, he 
argues, that metric structures are often implicit in ofdinal data. 

cL Scaling the locaiional npes. The locational t>pes were scaled bv the method of 
Kruskal (28). ind an isoplelh map of the scale values for the one-dimensional 
^lution is included ^s Figure 18. , it 

e. Significance of the recovered scale. The significance of the r£covere;|i scale is that* 
while "measured** frdm observed behavior m a spatial system of opportunities, it 
migfiPexptafm spatial choice in .a region where the density and arxangemeni of 
spatiaf opporturiiiies are different. Prefer^nc^ Scales are fundamental tiescnpt ions of 
behavior in that the> show how all hypothetical alternatives are evaluated.. Since a 
particular environment is a unique subset of the sel of hypothetical alternatives, the 
preference scale may be u;sed to evaluate this special case. It is this generally which 
an appiopnately designed preference s^ale.possesses that leads to its great potential 
in solving research problems. 

f. Inter-personal comparisons^ of scales. Ewing {12) has compared the preference 
scales of locational types for different social and economic groups of the lov^a 
households. He has ap{5lied signifLahce tests to the differences in probability values 
in the cells of an input matrix and he has .also used differences in scale values for a 
subset pf alternatives to compute the probabilfties ofinteractidn with any ahetna- 
tive. He found that the greatest difference in preference structures was between two 
grpups of , households* one of which, had^been shown to patronize the nearest 
available opportunity wKile the other was <^3mposed of h6usehol4s who by passed 
the nearest opportunity in favor of ^me other. This resulpnay be contrasted with 
that of Erniuth [11] who found r\o difference bet^jeetfUie preference scale of one 
group of urban households who clafmed^^^P' :i test question using the swnantic* 
(jiffepential) that the distance of a store was impuaant in their choice, and the scale 
of a second group who claimed that distance waj'not important.^ ' ' . , 

g. Temporal comparisons of scales. Several factors lead to temporal changes in such 
Scales. Changes in how, people evaluate alternattves are^often induced by changes in 
the character oJ[ the alternatives themselves and in the transportation system that^ 
relates, them to the alternatives^ these may lead to changed preference scales. One^ 
study has compared the Iowa prefer^nc^ scafe for grocery .purchases in 1960 wdth 
that of 1935 [46] . Major differences found in the two scales can be sufnmaFized as 
an inccpas^ijg tendency m [960 fgr Iowa households to by pass small towns at close, 
distances for larger towns farther away. Such a generalization has previously. been 



made on the baas of less formal and less quantitative research, but the precise 
calibration of the change is not possible unless the scales are computed. The growth 
t)r decline oT specific towns in this period, will depend to a. large degree on their 
position in relation to the two scales. ' ^> 

*■ , « 

3. Preference and choice In different environments. Compansons have1>een made 
between preference seated measured in.'two different environments, southwestern 
Michigan and Iowa (47) . The corwpaftsons showed, that, despite different distance- 
d^> functions for the probability of choices made at^ different distances, the 
"Y^overed preference scales are sln^^^ ^n the two areas. 

(L Jocnt'Space analysis of preference scales. A second approach to the question of 
choice in different environments is to ask whether spatial choices for like things in 
different areas ^n be* regarded as different points of view from which the two 
groups evaluate the stimuli (locational l>pes). Beginning with a matrix consisting of 
the scale rankings of the locational types for three s-ommodities in Iowa and for the 
same three in Michigan^ the .six sets of rankings we re^evaluated to determine where 
each was^ positioned in the joint space with the 30 stimuli (Table 14). The 
lucational types are defined in Figure jl 8. The two-<|imensional joint-space solution 
is sho'w^ in Figure 19, and the scatter plot of configuration distances and input 
ranks is^'showij in Figute 20. The stress value which measures the goodness-of-fit is 
0.052. Th? analvsis used the'TORSCA algorithm (61, p. 13). The close proximity 
in this preference space of five of the six points representing the groups shows that 
systematic differences in viewpoints of those five groups do not exist, rather that 
the ordering of spatial alternatives is similar in all five, cases. The sixth point, 
describing the vj^wpo'int of the Michigaif group choosing towns, for clothing 
purchases, is anomalous and deserves further study. Such differences in preference 
stnis:t\jre$ can be aHributedjo one or both of two sources. They may indicate that 
one group evaluates similar stimuli differently , for example, the resuljs above might 
mean that Iowa households* have 'a, greater propensity to' patronize small, local 
tuwns whe teas, t heir. Mis^higao counterparts might have changed 'their former habits 
and now pre?er to^ake the Idnger journey to the bigger towns. Alternatively, the 
different' preference structures might reflect the* fact that similar;sized towns in 
Miuhigan^and.Iowa might contain different amounts and types of clothing stores, 
the^i th(? observed pattern of Figurje 19 Vvoyld be a reflection of the'ambiguity. 
present in the surrogate variable *'town-size" as a measure of town content. Further 
research would clanfy these interpretations. However, the anomalous group aside, 
the tight cluster of the other five groups in the perceptual space and the accuracy 
with~ which the independently cf^mputed preference stnictijres could be recovered 
.from v^his j^mt ^space is ap indication of t|ie consistency of spatialtpre^erence , 
structures Tor different trip purposes, as well as for choice of towns inUiffefent 
areas. . . <■ • ' . ~ 
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Figure 18.«*pace Preference Structure for Grocery Purchases. Iowa 1960 (After 
Rushton, 1969(451) 

■ ■ ■ • •(.■•. : •. 

An ingenious recent application of joint-space analysis was designed to shed light 
on the problem of interpreting scaled dimension^ referred to in Section 1. In a 
study of individual rankings of U.S. states /or residential desirability, Lieber [31]* 
added to the m x n matrix of m states and n state viewpoints, Ix m objective 
measures of the srates on variables hypotheaz^d to be related to residbjitial desir- 
ability preferences. He then simultaneously scaled the m + n + 1 points |n the same 
space and interpreted interpoint distances between the state viewpoints 4nd the 
objective criteria as "a measure of the degree to, which the state viewpoint corre- 
siro'nded with the various objective criteria. Tables \5 and*16 show the resylts for 
the 17 most preferred states. ' - • 1 

A Incorporation of preference scales in diffusion models. De Temple [19] lhas 
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Figure 19. Joint Space for Consumer Spatial Choices in Michigan and Iowa. 
Legend. 30 unidentified points refer to the 30 locatibnal types. Points numbered 
1-Michigan, clothing, 2-!owa, clothing: :^-Michigan, appliances; 4-!owa, appli- 
ances; 5- Michigan, groceries; 6- Iowa, groceries ... 

argufd that a space-preference sfriicture is a rpore appropriate predictor of spatial 
interaction rates than the;more commonly used distance-d'ecay functions, since, as ' 
we staged above, the j)reference structure is mote sensitive to the unique distribu- 
tion of people or places in a giveacontextfoe temple used preference structures 
for towns selected for different commodily expenditures^generalized for the 
probabilistic, allocation, rule of Uce [36] in order to generate probabilities of 
intecaction with places-to govern the spread of the adoption of a fafirang innova- 
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tion. Formerly, distance-interaction rates^iad been used for this purpose. His opera-» 
ition^ii^ation of Jhe diffusion model is thus closer to the theoretical model first 
proposed by HSgerstrand [19]. Others experimenting with preference structures 

^ include Ewing [\2] and Girt [l3] , who have extended the application of space- 
preference structures by applying Luce's choice axiom, and Briggs [5] in his study 

. of preferences for shopping centers in Columbus, Ohio, ^ 



TABLE 14;RANK-t)RDER OF THE THIRTY LOCAT^OHAL TYPES 
' FOR THE iSIX PREFERENCE STRUCTTOES 



Lqcational ^ 
Types 



Mich. f Iowa 
ClotKng 



Mich.^^ Iowa 
Appliances , 



Mich. -Iowa 
Groceries 



•1 


16,000 .-^ 


21^000 


13,000 • 


1 

16,000 


12,000 


1 5,000 


2 


15,000 ' 


24^,000 


19,000 


21,000 


1 6,000 


2-2,000 


3 


28,000 


27^000 


^ 21,000/ 


24,000 . 


20,000 


25,000 


V 4 


27,000 


28,000 


30,000 


27,000 


29,000 


28,000 


' 5 


25,000 


29,000 


29,000 


30,000 ' 


28,000 


' 30,000 


6 • 


19,000 


17,000 


7,000 


1 1 ,000 


8,000 


•10,000 


7 


23,000. 


18,000 


15,000 


18,000 


14,000 


• 17,000 


. . * 8 


29;000 


25;,000 


17,000 


19,000 • 


23,000 


• 9,000 


9 


26,000 


22,000 


28,000' 


23,000 


22,&00 


,24,000 


i 10- , 


24;0t)0 , 


30,000 


27,000 


.29;000 


27,000 


29:'OO0 


11 


10,000 


. 8,000 


'4,000 


5,000 


4,000 


5,000 




' 6,000 ' 


i6,000. 


11,000 


9,000 


11,000 


' 13,000 


< 13, 


r8,Q00 / 


19,000 


' 16,000^ 


25,000 


17,000 " 


19,000 


14 


22,000 


23,000 


23,000 


17,000 ' 


25,000 


26,000. 


15 


20,000 


26,000 


26,000 


,28,000. 


30,000 


.27,00b\ 


' 16 


11,000 


4,000 * 


' 6,000 


8,000 


3,000' 


• 1,000 


17 


^ 7,000 


1 1,000 


12,000 


12,000 ^ 


'ifiOO 


11,000^ 


. 18 


f7,000 


.14,000 


22,000 


. 15,000 


15,000 


14,000^^^ 


19 ^ 


30,000 - 


15,000 


25,000 


• 26,000 • 


24,000 


'21,000 




21,000 


20,000 


24,000 


22,000 


26,000 ' 


23,000 


21 


• 9,000 


J, 000 


9,000 


4,000 


5,000 


' 4,000 


22 , 


- ,3,000^-^ 


^ ' 6,000 , 


i;ooo - 


6,000' • 


,,1,000 


; '6,000 




13,000, ' 


' ;^,ooo 


*1 4,000 


3,000 


13,000 


• 12,000 


, . 24 * 


14,000' 


12,000 


18,000. 


;i 3,000 


i9,ooa 


f6;ooo 


25 


12,000 


a3,(*6 


20,000^ 


20,Q00 


21,000 


2D,000 . 


26 


2,000 , 


1,000 , 


/3,000 . 


. 7,000 


18,000 


'3,000 


27' 


1,000 * 


' 2,000 


' 2,000 


2,000 ) 


^2,000 . 


^ v2,000 ' 


28 


4,000 


5,0Q0 


5,000 


• 1,000*^"^'^ 


6,000 


7,000 


^ 29 


8,000* 


7;000 


: 10,000* 


10,000 


' 10,000 


* 8,000 


' 30 


5,000 


10,000 


. 8^000 • 
» / 


M 4,000 


9,000 


18,000 . 






'i 'V 


» . / 










69 



State View ^ 

/Utah / 

Texas 

Vermont 

South Dakota 

Tennessee 
- Indiana ■ 

California 

Maryland 

Florida 



Alphabetic 
Contiguity OffefS 



TABLE 15. INTEim)INT. DISTANCE MATRIX OF GENERAL VIEWPOINT 
AND OBJECTIVE CRiTERION OF 'rtlE SEVENTEEN PREFERRED STATES 
: * ; 0>J^E PARTITIONED RANKINGS -y, , - 

^arm 
^limate 

1.3 
0.203 
173 
0,280 
0.056 
0.434 
.157 
.643- 
18P 



0.285 
1.256 
1.250 
f.204 
1.237 
1.356 
1.230 
1.444 
1.242 





Coastal/ y //^ Regional 
Recreational' Djstance Preference 

1.578 
■ 0.'808 
0:838 
0.733 
0.960 
• 0.583 
0.852 
0.375 
0.829 



TABLE 16. FREQUENCY OF CRITERION IMPORTANCE ON THE 
ST^TE VIEWPOINTS OF PREFER^ED^ATES 




1.379 
0.fl09 
0.040 
0.067 
0.1 6 r 
0.222 
0.057 
0.431 
0.032 





. Most 


Second Most 


Criterion. 

* ■ • 


ImportanVFactor 


Important Factor 


AlphabetiCi'Coiitiguity 


1 . s 


0 


Urban Offerings 


' - 1.'5 


' - . '3.5 * N 


Coastal7Recreational 


1.5 


,3 ' - 


Warm" Climate 


1 


0 


•Distance . 


1 


0 


Regional Preference • 


3 


, .1.5 


.None 


•* — 


. 1 - 


Total 


9 


9 









4. Incorporation of prefefence scales in central-place theory. Since central-place^ 
theory describes the theoretical location of busimess clilsters (scttfements) resulting 
from th^ mutually adaptive behaviqj patterns of entrepreaeOrs and consumers, 
scales that describe how one of these groups responds to actions of the othej:grou^ 
ought to be ^he fundamental postulates of the theory. But the scales that were used 
in Jthe classical statements were so simple that the.y were not commonly recognized 
as scales. Thus, from Jhe consumer's point of view, the postulate of Christaller that 
the consumet would patronize the closest place ofjif^ftng the required goods was 
essentially a scaling of relevant^^Uernativeson thf<fistance variable and postulating 
that the alternative with minimum distance sc^kfvalue would be most preferred and 
hence patronized. Described in this way, U/^ld se^ to be a natural development 
for the theoretician to question the>f?ect on the derivfedjettlement patterns of 
substituting more realistic consuijicfr preference scales foi^th^ one used by Christal- 
' ^ )licitly accepted bjMm)st researchers who have "tested" the theory since 
his work fiTsNoiinfi^ed.Thti'question has been discussed [6] , and a formal model of 



, central-place theory }ias been constructed in which the consumer preference scale* 
has beer^ varied [48] . j^Jhe results were' that, accegiing all the other postulate^, 
constraints, and environmental assumptions of Christaller, thegrouping of unique^ 

bundles' of goods in^a central-place hierarchy was no longe; ^erived property of" 
the theoretical systenu Such findings support the thesisthat preference scales are an 
important input to location theories for mutually, ^iwive behavior patterns which 
ar^a hallinark of alllocation theories. ^ " , ' ' - 

Furtfier evjAence of the usefulne^ of suchapreferences in substantive research is 
found in other work unpublished at The University of Iowa. Mr.' Ty Bell has^itsed 
trend surface equations of the scale values in a computer progran/ that estimates 
tributary populations for market centers. From a close checkerboard sample ot 
rural locatioixs, the algorithm evaluates surrounding towns and allocates the area 
surrounding the sampfliypoint to the most preferred mark et cen ter, using the equa- 

""tionjor the preference function for the activity in quest jpn. Bell is currently 
comparing the relative sites of tributary populations surrounding centers which 
Wve lost certain activities in the past, decade with centers that have supported fhe 
activity or have added it to their business structure during that period. 
' . /' — ■ 

m • ^ * 
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The overall strategy adopted in this monbgraph has'teen to present MDS as a 
usefulrtbol for.ge.ographers. ToMo this, we felt that it was necessary to explain the 
component parts of the method? to mention the types of metric and non-m$t'r4c 
: problems to Which it had beeif applied, to discuss^he nature of da^ta required for 
inpMt^^and to examine the mathematical 3l||||^?,tiire of the^ technique,' It i),eht)oves us 
now to comment on some^of the problems, mVolved in tiding the technique, and to 
elaborate on the areas of potential use of MDS analysis in.the'^discipline. 

\. Sjrtne Profebfes of Using Mds * .v^. . - v , 

^ . / ^ •* * . ' 

The problehi otf identifying dimensions^has previously been mentioned. Perhaps 
th^^os^ejious problem-relates to the **stopping'' subroutines which determine 
Mheji^^ and "fit ^ the output. This is the globa) minimum problem, 

"^^^^^^^^jfcj^ stress values at each iteration help to determine the goodness-of-fit 



between a ^rived configuration and an actual dat^set. "Sat*isfactory" stress values 
caa someticries. be^obtafned, \jthen "local mihima'' are reached; continued iteration 
mSy at^ftrst produce ai^ fncrease in stress aad then a.decr^ase whichi results in values 
considerably Ipwer than those obtained at the local^'minimum. Resulting configura- 
tions and Intenpoint dist^r^es may also change considerably in this process. Most 
^DS algorithm's check for local minima by char^ging step si^es an'd continuing the, 
iteratior^ a given number of tii)ies. For th? most part t^Vse procedures satisfy a 
researcher. An ahernative method of handling this problem is to rerun the Iterative' 
sequence be^nning with different starting, configurations and checking to\ee if 
Approximately the sanpe configuration is obtained on each)nn. However, there is no 
sure way of determiiiin^ if a glo<l!l minimum has been reached, consequejitly, the 
tethnique'should not at this stage be used to define sy.ch minima^f • 

B.vPoti^^fial^lIses of MDSJn^GeograpIiy " . „ 4^ 




^"ttfeMJOtential uSes of MDS^hich is as yet largely .unexpjored/n geOgr^ 
rapJi^J^^ljnhawJfc^^ values for input into othef anafyilcal algorithm&7 

' Perhaps, 't|| jnost ^bvfous of these uses is the application of clusteHeu^niques on^V 
MDSoiitpiut configuriR&^s. Configurations uin b^^nalyzed^in.this m^ner^ither in\ 
simple tjrjoirjt ^paceslfn the latter case, either individual or stimulus coniigur^tipns' ^ 
"fen-|ie^afnined ^ ' V ^ * . ■ <C^ - - ^ ^ 

^\W|i!e geographers ^re fr^qaently interested in the patt^h of .output cX5n|igui^.^ 
.tipjis,"^ other ^^pi^pline^ are more intere^te^ in Jiie ical(?.j^^ 
distances denized, from MDS analyses! For example, it may be' posliBle 'to us^ . 
. interpoinr (^stances as input into ^lecisiori piodel's Tn an attempt to predict chpices. 
In joint sp^Lce analyses, dist^ri^s between individuals and stimulus points ^ome 
jbe data inputs into such ^odels.wM though no studies exist as yet which combine 
characteristics of cliisters fsuch*a^ mean areal centers) and joint space di^ances, it^^ 

■ ■ *%>^. '^X^ ,<f . "SI . 



seems that thSy are eminently feasible for determining likelihoods of interaction 
between groups of individuals and specific stimulus points. \ 
*l ' Our summaries of the worl^ of Toblef, Kendall, and Wilkinson IHustrate one of 
the more creative uses of MDS analysis-that of filling in "missing data'' or speculat- 
' in| about the locations of phenomena. Archaeologists and historical geographers 
appear therefore to have at their disposal an interesting and innovative method of 
analysis fop their disciplines. / ♦ ■ ^ 

* Whether joint space or simple space procedures are used, MDS appears to have a 
multitvde*of potential u^in geography. In addition to uses such as those above, 
we suggest the'following areas of potential research: , 

1) *to stratify populations according to'specific social, economic, ethnic, or 
other characteristics in order to define-'precisely how variations in these 

/ phenomena influence perceptions of locations and attributes of things, and 
' \^ to determine the cpntrib\Uion of such stratifications to Variations in pref- 
♦ * erence rankings^jfphenomena by subjects. ^ 

2) to determine how ^istance^or ibcational characteristics of configurators, 

f change with time; this par^ly-indicates the role of learning and information ^ 
gathering in obtaining perceptual accuracy. Th^ results can^then b/ used to 
help explain a variety of s^fial behtlviors including jourfiey-to-wprk, con- 
sunder behavior, and residential site selection: — * • 
^3) to assist in translating non-metric data to metrie distance measures fo^ 
* purposes such as distinguishing clusters of like aad unlike phenomena. 
^ 4) as a potential aid in policy making by deternaining the "perceptual gaps!' 
that exist in groups of phenomena: for example, finding the locational and 
* perceptual properties of stores that should be developed'to serve minoFityi 

- and .other populations, y , , • . 

5) to experiment with the notion o/f perceptual distance, to translate it into 
metric terms, and perhaps revise our widespread use of just tWo- 
dimensional Eliclidean distancesdn explanatory models of spatial b^ehavidr, 

6) |o examine the functional relationships between perceived and "actuaP 
.distances in-order to determine tfie range over which these^distanols agree^ 
the natuii^of the relationship (i.e., whether linear pr non-linear), the rate 
at which divergence occurs as distance increases, and pthe^facts concern- 
ing the two type^oM^tance.* ' " x^"^^" " ~ 

"^^"7) to' examine tfie nature of terms 4ich as "proxjmity*' and "closeness'! in 

order t^establish tneaningful uses of the terms ^ri spatial analysis. 
*^ Thie research questions proposed [ab9V£hjiv^existed in confused sense in 
Jeeograohy fof many years, the math*«atica[t*<Kvd reviewed in this mono- 



— ^n their clarification atjid formalization, providing concepts, ternjs, 
theoreds, and empirical findings that Stimulate the researciher!? imagination. Since 
.eacliquestioin^ be ifelated to r^seawK tiiat has beep undertaken by^jebgraphers 
^S^^^|^0ew- S^egrs,, it would be pmdenf to note that in each*case the question, 
raiSeJlii^jiytim by other fomis df data analysis. .However^ expdfh 

fnce to date has indicated tRs^^iaUifu^ of "MDS approaches in thjj»ftdI^/'of 
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these quesuons ,The fact thaj,the> have been raised so recently, however, illustrates 
' how dev'elopments in basiL»anal>tic^il procedures stimulate researchers to ask new 
questions. 

The tjevelopnient of non-metric MDS has therefore provided the geographer witli- - 
co'ftcepts and techniques by which he can expect to solye some of the puzzling 
measurement problems that have impeded the developiVient dT behavioral ggpg- 
raphy. For example, geography has only just begun to research and to'rrlfeasure the 
' /t>rm of preference structures. We can surely expect that tnan's adjustment to and 
behavior in his, physical and human environment will more commonly be inter- 
preted and researched as his reaction^o a perceived s^^t of stimuli. His evaluation of . 
. these stimuli will 'become, a primary research problem. Decisions made in this 
■ environment will invreabingly be viewed as a process by which basic preferegqes are^ 
linked to perceived sets of stimuli. - . 

In addition to their role in understanding cfioic'e behavior, preference structures 
will increasingly be usedin the' normative sense of designing or controlling operat- 
ing spatial systems in order to optimize subjective preference functions. Thus the 
central importance of basic preference studies is emphasized and the accompanying 
•rofe of MDS asslred. " 

In Section I, we mentioned^the .existence of seveVal MDS algorithms. Each of 
the^ has been programmed and information regarding their availability can be 
obtained from either the Geography Computer Exchange Program at Michigan 
State University or from the authors. themselVes. a\ recent apvelopmeljit, not dis- 
cussed in Qur text is>the MINI^SA series of scaling p^pgijal^s developed by Lingoes 
an^ Ro5kamj»we h^ve not^yet examined this algorithm in detail but its structure , 
• '^ahd advantage&jire discussed by the authors^[33] . 

If! conclusion, we reiterate oyr position that although MDS is still in.a develop- 
mental stagr^it provides a useful and construcj^ive methodology for examining the- 
. probtg?Tis of preference and choice- whfch are o'f increasing concern to researchers in 
geography, and we express the hope that our treatmept of the problem will increase 
' ^ knowledge and availability of the method in the discipline. 

■•- / . ■ • . '-m' i ' 
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